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A TEST OF A STATISTICAL LEARNING THEORY MODEL FOR 
TWO-CHOICE BEHAVIOR WITH DOUBLE 
STIMULUS EVENTS '? 


NORMAN H. ANDERSON? AND DAVID A. GRANT 


The University of Wisconsin 


This paper reports three experi- 
ments bearing on a mathematical 
model for two-choice behavior based 
on statistical learning theory (3, 4). 
The model is closely related to that 
used by Estes and Straughan (5) for 
the case in which exactly one of the 
two choices is correct on each trial. 
They assume the existence of a popula- 
tion of stimulus elements whose de- 
fining properties are as follows. Each 
element is either in the state of being 
conditioned to the response class A, 
(predicting event E,), or else in the 
state of being conditioned to response 
class A, (predicting event E,). On 
each trial, the signal to respond acti- 
vates a random sample containing the 
mean proportion, 6, of the population 
of stimulus elements. The response 
ig then determined probabilistically 
according to the proportion of ele- 
ments in the sample which are in each 
of the two states. At the end of the 
trial, all elements in the sample are 

This paper is based on the first author's 
M.S. thesis (1) which was greatly facilitated by 
a National Science Foundation fellowship. 

? Thanks are due to Nancy Angrist Myers 


who was a co-worker in the conducting of Exp. 2. 
* Now at Yale University. 


conditioned to the response class 
which was in fact correct on that 
trial, independent of S’s actual re- 
sponse. ‘The sample is then returned 
to the population and the process is 
repeated with a new random sample 
on the next trial, although there ap- 
pears to be some necessity for assum- 
ing that successive samples are not 
independent. 

The model tested here extends the 
above situation by allowing the occur- 
rence, in addition to EF, and Ey,, of 
both FE; and Es, or neither FE, nor Eo, 
on various trials. The occurrence of 
neither E,; nor E» will be denoted as 
event E». The occurrence of both 
E, and FE, will be denoted as event 
E3. Eo and FE, will be called double 
events; FE; and FE, will be called single 
events. A number of experiments (2, 
9, 12, 13, 14) have been performed 
with this experimental situation, and 
mathematical models for the case 
where Eo, Ey, and EF», but not Ey, may 
occur, have been proposed by Neimark 
(12), and by Atkinson (2). Assum- 
ing that conditioning takes place in the 


way assumed _in the model _ used by 


Estes and Straughan on trials where 
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This paper reports three experi- 
ments bearing on a mathematical 
model for two-choice behavior based 
on statistical learning theory (3, 4). 
The model is closely related to that 
used by Estes and Straughan (5) for 
the case in which exactly one of the 
two choices is correct on each trial. 
They assume the existence of a popula- 
tion of stimulus elements whose de- 
fining properties are as follows. Each 
element is either in the state of being 
conditioned to the response class A, 
(predicting event E,), or else in the 
state of being conditioned to response 
class Ay (predicting event E,). On 
each trial, the signal to respond acti- 
vates a random sample containing the 
mean proportion, 6, of the population 
of stimulus elements. The response 
is then determined probabilistically 
according to the proportion of ele- 
ments in the sample which are in each 
of the two states. At the end of the 
trial, all elements in the sample are 

This paper is based on the first author’s 
M.S. thesis (1) which was greatly facilitated by 
a National Science Foundation fellowship. 

* Thanks are due to Nancy Angrist Myers 
who was a co-worker in the conducting of Exp, 2. 

* Now at Yale University. 


conditioned to the response class 
which was in fact correct on that 
trial, independent of S’s actual re- 
sponse. The sample is then returned 
to the population and the process is 
repeated with a new random sample 
on the next trial, although there ap- 
pears to be some necessity for assum- 
ing that successive samples are not 
independent. 

The model tested here extends the 
above situation by allowing the occur- 
rence, in addition to EF, and Ez, of 
both FE, and E., or neither FE, nor Eo, 
on various trials. The occurrence of 
neither EF; nor FE. will be denoted as 
event Ey». The occurrence of both 
E, and FE, will be denoted as event 
Ey. Es and E, will be called double 
events; E,; and FE, will be called single 
events. A number of experiments (2, 
9, 12, 13, 14) have been performed 
with this experimental situation, and 
mathematical models for the case 
where Eo, Ey, and E», but not Ey, may 
occur, have been proposed by Neimark 
(12), and by Atkinson (2). Assum- 
ing that conditioning takes place in the 
way_assumed_in_ the model used by 


Estes and Straughan on trials where 
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single events occur, the critical point 
in the model is the disposition of the 
random sample on trials where double 
events occur. If Es occurs, it is 
hazardous to assume that the sample 
elements become conditioned to the 
response just made since, although it 
was correct, S can see that the other 
choice would have also been correct. 
A similar argument holds if Ey occurs. 
Accordingly, a sharing parameter, a, 
is introduced for each double event 
which specifies what proportion of the 
sample is conditioned to the response 
just made (or, equivalently, on what 
proportion of the trials the entire 
sample is so conditioned). 


Proceeding in the same way as in the 
Estes-Burke-Straughan (4,5) develop- 
ment, one obtains the equation of the 
mean learning curve* 


p(n) = p(~) — [p(~) 
— p(0)](1 — B)", (1) 
where 


p(n) = probability of response A; on 
trial n, 


p(0) = initial probability of response 
i 


wo(1 — ao) + ma(l — as) + 
1 + (1 — 2a) + #3(1 — 2as) 
1— PB = (1 — 0) + Ofo(2ay — 1) 

+ m3(2ay — 1)] 


and where 


p(*) = 


x; = probability of event Fi, i = 0, 1, 
2,3 


a; = sharing parameter associated with 
event E,, j = 0, 3. 


It is seen that this equation is again a 
growth curve with asymptote, p(), de- 
pending on the m, and a,, and with learn- 
ing rate, 8, depending on the m and a; as 
well as on 6, which is the learning rate in 
the case where only the two single events 
occur. 


* Mathematical details, as well as the treat- 
ment of the general case of r responses and 2° 
events may be found in Anderson (1). 
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To obtain an empirical estimate of the 
learning rate, knowing p(*) and p(0), 
Equation 1 is summed over N trials to 
yield 


N— 


LX p(n) = Np(~) — [p(~) 


n=0 
1 — (1 — §)* 
p(0)}(1 — 6) += (i B)’ 
Using the data to estimate the sum on 
the left, 1 — B is obtained by successive 
approximations. 
If the a; chance to assume the value 
unity, one obtains 


(2) 


us) tr 
oad te im mn ro)| 


X (1 — O(r, + #2) }” (3) 


which is independent of the individual 
values of 9 and 3, although not of their 
sum.® For x, + m2 = 1, so that neither 
of the double events, Eo or Es, occurs, 
this reduces to the equation of Estes and 
Straughan. 

Using the notation p(x | y) to denote the 
probability of x given y, and considering 
only trials on which one of the single 
events occurs, the quantity a defined by 


a = p(A;|E,) — p(Ai| Es) (4) 


is an estimate of 6 within the model, as 
was shown by Estes and Straughan. 
Without reference to the model, one may 
interpret a as a measure of stimulus con- 
trol over S. Its values lie between —1 
and +1. It assumes the value 0 if S’s 
actions are independent of the stimulus 
sequence, the value +1 if S always makes 
next that response which was correct on 
the preceding trial, and the value —1 if 
S always makes next that response which 
was incorrect on the preceding trial. 
Similarly, conditional probabilities of 
response may be considered for those 


*It should perhaps be specifically noted that 
this is the appropriate generalization of the 
“matching solution” model (for these values of 
the sharing parameters). For example, if the 
event probabilities are .7 and .3, and if the events 
are independent, then the asymptote predicted by 
the model is .84, not .70. 
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trials on which the double events occur. 
Defining 4 as 


6 = p(A; on trial m + 1/E, 
and A, on trial m) 
— p(A, on trial » + 1|E; ; 
and A; on trial m),j = 0,3, (5) 


then, within the model, 4 is an estimate 
of 0(2a; — 1), and if a; = 1, this gives 
another estimate of 6. Without refer- 
ence to the model, 4 can be interpreted 
as a measure of perseverative tendency 
in the face of a noninformative event. 
Its values lie between —1 and +1. If 
S has alternation tendencies, then this 
quantity will be negative; if § has per- 
severative tendencies, then the difference 
will be positive. 

Three experiments are reported here. 
The first is a pilot study which was run 
to gain preliminary information about 
behavior in this situation. The second 
is primarily methodological, investigat- 
ing the effects of instructions and trial 
rate. The third experiment was de- 
signed specifically to test the model; its 
design follows closely the experiment of 
Estes and Straughan (5), and includes 
two “control” groups whose function is 
to calibrate the procedure used here 
against their results. 


Mertnop 
Apparatus 


The apparatus, slightly adapted from that 
used in previous work (11, 14), was the same 
for the three experiments. ‘Two 6-w. opal lights, 
mounted 10 in. apart horizontally on a vertical, 
flat black, 15 X 20-in. panel, formed the stimulus 
display. These lights were activated by a 
punched tape fed through a Western Union tape 
transmitter. An audio oscillator generated an 
800-cycle tone which served as S’s signal to 
respond. A control mechanism consisting of a 
6-rpm synchronous motor and associated relays, 
cams, and microswitches governed the presenta- 
tion of the tone and the sequence of stimulus 
lights. 

A row of four partially separated Ss’ booths 
was placed 9 ft. from the stimulus display. 
Each booth contained a small box with two non- 
self-releasing keys, corresponding to the two 
lights, which Ss pushed down and then up in 
making their predictions. Responses were re- 
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corded automatically with an Esterline-Angus 
recorder. 

Except for the stimulus display and Ss’ 
booths, all equipment was placed on E's desk 
which sat behind Ss’ booths and from which F 
monitored the course of the experiment. 


Procedure 


The Ss were volunteers from elementary 
psychology classes who obtained class points for 
their services. They were randomly assigned, 
in groups of two, three, or four, to the various 
experimental conditions. No S was used more 
than once. 

In each experiment, Ss were seated in the 
booths and given instructions stating that S 
should always press exactly one key each time the 
tone sounded in order to indicate which light he 
thought would flash on next; that if both lights 
flashed, either choice would be counted correct; 
but that, if neither light flashed, this would not 
be counted against S. 

Except for a few preliminary trials in Exp 
2 and 3, the successive events of the stimulus 
sequence were given without rest. ‘The events 
occurred randomly, according to the probabilities 
designated below, subject to balancing in blocks 
of 20 trials for Exp. 1 and 2, and in blocks of 50 
trials for Exp. 3. 

Four independent event sequences were used 
in Exp. 1, and three independent sequences in 
each different probability condition of Exp. 2 
and 3. In Exp. 1 and 2, the left light was 
always the more frequent. In Exp. 3, right and 
left were counterbalanced, and double events 
occurred on the same trials for corresponding 
groups. 

Three experiments differ in instructional de- 
tail, length and composition of the stimulus 
sequences, and in trial rate as follows. 

Experiment 1.—Following 12 initial trials in 
which each of the three possible stimulus events 
occurred four times in random order, 220 trials 
were given with these event probabilities: both 
lights, .5; left light alone, 4; and right light 
alone, .1. ‘The trial duration was 10 sec. divided 
as follows: signal tone, 1 sec.; waiting time 
(during which responses were to be made), 3 sec. ; 
stimulus lights on for 3 sec.; intertrial interval, 
3 sec. Instructions, which were read by E, were 
similar to Instructions D of Exp. 2. There 
were 35 Ss in this experiment. , 

Experiment 2.—The main variables were trial 
rate and instructions. ‘The four sets of instruc- 
tions, denoted by A, B, C, and D, had identical 
initial parts which gave operating instructions 
and told Ss that their job was to make correct 
predictions. The four sets differed in their 
concluding parts which were, in brief: A: your 
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job is to outguess the experimenter; B: no addi- 
tional content (besides the same exhortation to 
do well that was contained in the other three 
sets); C: this is an experiment in learning; you 
should try to learn how to improve the accuracy 
of your predictions; D: this is an experiment in 
learning; you should try to learn how to improve 
the accuracy of your predictions; there is no 
fixed pattern in the way the lights come on; act 
sensibly; if you can’t get them all, get as many 
as you can. All instructions were presented 
with a tape recorder 

Each trial was divided thus: signal 
(during which responses were to be made), 2 sec 
waiting time, 1.2 sec.; stimulus event, .& sec.; 
intertrial interval, 1, 3.5, or 6 sec. 


tone 


’ 


These values 
of the intertrial interval gave trial rates of 5, 
7.5, and 10 sec. Groups run at these speeds 
received 360, 300, and 240 trials, respectively. 

The experiment splits into two parts, denoted 
by Exp. 2.1 and Exp. 2.2, according to the event 
probabilities used, The two parts were run 
Experiment 2.1 was a 4X 3 
design formed from the four sets of instructions 
and the three trial rates. ‘There were 234 Ss, 
18 in each cell, except that 36 Ss were run using 
Instructions C and the 10-sec. trial rate, 
Kighteen of these were chosen at random for the 
analysis, but all 36 are 
Event probabilities were: 
light, .50., 

Experiment 2,2 was a 1 X 3 design formed 
from the three trial rates and Instructions C. 
A total of 71 Ss was used. Event probabilities 
were: left light, .75; right light, .50. 

All Ss in Exp. 2 these four pre 
liminary trials: left light alone, right light alone, 
neither light, both lights, in that order. Follow- 
ing these trials, the apparatus was stopped and 
Ss were asked if they had any questions. Ques 
tions pertaining to the nature or purpose of the 
experiment were answered by paraphrasing the 
instructions. 


simultaneously. 


used in the graphs. 
left light, .90; right 


receiy ed 


TABLE 1 


Group Desicnations ANd Event 
Propapitities or TRIALS 
101-400 or Exe. 3 


Group 70 


* 
* 
~ 


5 0 
30 0 
30-0 30 
30-3 0 
AS-O AS 
45-3 0 
HOD | oO 
3 | 0 


cao ead ome tm amet tend gd Ome 
Su 
wre en x 


AAA 


Note.--#i = Probability of Event Ey. 
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SUCCESSIVE BLOCKS OF 10 TRIALS 


me 
Fic. 1. Mean percentage of right-key presses as 
a function of trial blocks in Exp. 1. 


Experiment 3.—All eight groups in Exp. 3 
were run using the 5-sec. trial rate and Instruc- 
tions C of Exp. 2 (except for small changes in the 
wording for the two groups receiving only single 
read by E. Six 
groups formed a 3 X 2 design with probability 
(.30, .45, .60) and type (neither light, both lights) 
of double event as variables. Two groups re- 
ceived only the two single events (right light 
alone, left light alone) and served as control 
groups to calibrate the present procedure against 
that of Estes and Straughan (5). 

All groups were given three preliminary trials: 
both lights, neither light, both lights, in that 
order. The apparatus was then stopped and 
questions were answered as in Exp. 2. There 
followed 424 trials without pause. The first 24 
trials were adaptation trials in which the two or 
three events appropriate to each group appeared 
equally often. All groups were treated alike 
(wo = 0, wy = 0, wm; = .75, and wz = .25) on the 
next 100 trials in which only single stimulus 
The groups treated 
differently on the last 300 trials, but in all cases, 
the light previously the more frequent became 
the less frequent. ‘The event probabilities and 
group designations for the last 300 trials are 
given in Table 1. Right and left were counter- 
balanced so that F, refers to the event occurring 
.75 of the time on the first 100 trials. 


events). Instructions were 


events occurred. were 


RESULTS 


Experiment 1.—The data of Exp. I 
are summarized in Fig. 1 which plots 
mean percentage of right-key presses 
as a function of trial number, omitting 
the 12 initial trials. The horizontal 
line at 20% is the minimum asymptote 
allowed by the model, corresponding 
to a value of the sharing parameter, 
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The mean curve starts sig- 
nificantly (.09) above the 50° level 
and descends to the neighborhood of 
the theoretical minimum. Accord- 
ingly, a; would be expected to have a 
value close to I. 


as = 1. 


At the same time, 
however, the terminal performance 
suggests that the curve may still be 
decreasing, that asymptotic perform- 
ance has not been reached, and, con- 
sequently, that the model does not 
describe the data. Therefore, it was 
decided to perform Exp. 2 primarily 
to test effects of instructions and trial 
rate in order to obtain more informa- 
tion as to the limits of applicability 
of the model. 

Experiment 2.—Figure 2 plots mean 
percentage of right-key presses as a 
function of trial number for Exp. 2. 
The lower three curves represent the 
data of Exp. 2.1 pooled over instruc- 
tions which had, negligible effect. 
Starting somewhat the 50% 
level, the curves descend steeply to an 
apparently stable level a little below 
10%, with the group doing 
poorest, the 7.5-sec. group best, and 
the 10-sec. group lying 
Over Trials 181-240, the response 
rates are 9.8, 6.3, and 7.9% for the 


above 


»-seCc. 


between. 


5-, 7.5-, and 10-sec. groups, respect- 
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Fic. 2. Mean percentage of right-key presses 
asa function of trial blocks in Exp.2. The three 
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upper curves are from Exp ; the three lower 


curves are from Exp. 2.1. 
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NUMBER RIGHT-KEY PRESSES 
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Fic, 3 


making given numbers of right-key 


Histograms of the number of Ss 


presses on 
Trials 141-240, inclusive, for the three trial rates 
of Exp. (RR 


ively. ‘The three upper curves repre- 
sent the data of Exp. 2.2. Here the 
curves fall in a natural order with the 
faster trial rate groups doing poorer. 
Mean response rates over Trials 181 
240 are 32.6, 26.8, and 18.8°%% with 
standard errors (based on different 
N’s) of 4.1, 3.7 and 1.5% for the 5.- 
7.5-, and 10-sec. groups, respectively, 

Analysis of variance on Exp. 2.1 of 
total scores on Trials 1-240 showed 
that only trial rate was significant 
(.05). It had been intended to per- 
form a similar analysis over ‘Trials 
141-240 in order to test rate effects on 
asymptotic performance. However, 
the extreme skewing of the scores (see 
Fig. 3) precluded this, so the test was 
made on total scores over Trials 41 
240. Nothing approached  signifi- 
cance in this test. 

Analysis of variance for Exp. 2.2 
over Trials 1-240 showed that trial 
rate was significant (.05). A similar 
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Mean percentage of A, responses as a function of trial blocks in Exp. 3. 
The curves are theoretical; see text. 


analysis over Trials 141-240 showed 
that trial rate was again significant 
(01). 

Although the experiment was not 
designed primarily to test agreement 
between terminal performance and 
theoretical asymptote, some informa- 


tion is available on this point. Per- 
forming t tests on mean scores of the 
three groups of Exp. 2.2 over the last 
60 trials for each group showed that 
only the 10-sec. group departed sig- 
nificantly (.01) from the minimum 
theoretical asymptote obtained by 
setting both sharing parameters equal 
to unity. 

Similar tests for Exp. 2.1 are difficult 
to make because of the skewed distri- 
butions of the scores. However, more 
incisive information can be gained by 
considering these very distributions 
which are plotted in Fig. 3. This 
figure lists, for each of the three rate 


conditions, the number of Ss who 
made given numbers of right-key 
presses in Trials 141-240, inclusive. 
For example, in the 10-sec. condition, 
20 Ss did not press this less profitable 
key even once in these 100 trials while 
there were 9 Ss who made the less 
profitable response more than 20 
times in the same period. The distri- 
butions for the various rate conditions 
are quite similar although it is inter- 
esting to see the consistent differences 
of Fig. 2 reflected in the number of Ss 
who made more than 20 right-key 
presses. 

The model predicts an approxi- 
mately binomial distribution of indi- 
vidual scores about the asymptotic 
mean (5), but it is clear from inspec- 
tion of Fig. 3 that the empirical dis- 
tributions do not conform to expecta- 
tion. Incomplete x*’s, based on the 
contribution of the single leftmost 
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rectangle, already gave a significant 
(.01) departure from binomiality for 
each of the three histograms. 

Experiment 3.—The mean learning 
curves for the eight groups are shown 
in Fig. 4 as a function of trial number, 
ignoring the 24 adaptation trials. 
The groups start from about the 50% 
level on the first block of trials, in- 
crease to about the 75% level by 
Trial 100, reverse their preferences as 
the formerly less frequent light be- 
comes the more frequent, and then 
settle down to a more or less steady 
rate of responding. 

Two analyses of variance were per- 
formed on the 3 X 2 design of this 
experiment, one for total A; responses 
over Trials 101-400, one for total A, 
responses over Trials 301-400. Right 
vs. left, having been kept orthogonal to 
the two main variables, was included 
as a factor in the analysis. In both 
tests, only frequency of double event 
was significant (.01). Neither of the 
other variables, nor any of the inter- 
actions approached significance. In 
particular, it may be concluded that 
type of double event had little or no 
effect on number of A; responses. 
The error terms from these analyses 
are used in the tests of the theoretical 
curves given in Table 3. 

The theoretical curves of Fig. 4 
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were fitted as follows. Estimates of 6 
for each group were obtained using 
Equation 2 transformed to yield an 
equation similar to Equation 6 of 
Estes and Straughan (5). Here p() 
=m, = .75, and 8 = 6, since only 
single events are involved. The ini- 
tial probability was estimated from 
the first block of 20 trials. Estimates 
of the sharing parameters were ob- 
tained by substituting the asymptotic 
response rates of .32 and .30 for 
Groups .45-0 and .45-3, respectively, 
together with the appropriate values 
of the x, in the expression for p() 
below Equation 1. This yielded the 
estimates, ap = .84, and a; = .92. 
These values are close to the unit 
values suggested by the previous 
experiments, and by the work of Nei- 
mark (12). For simplicity, each shar- 
ing parameter was set equal to l, a 
procedure which does not materially 
affect the over-all goodness of fit 
considerations. 

With values of @ and a; at hand, 
Equation | was used to plot the the- 
oretical curves over Trials 101-400 
by starting counting on n at Trial 101. 
The initial probability, p(0), over this 
stretch was given the a priori value of 
.75, the theoretical asymptote over the 
first 100 trials. Since the curves for 
the paired groups never differed by 


TABLE 2 


Constants or Equation 2 Usep to Catcutate Tueoreticat Curves ror Exe. 3 


Group 














Learning Rates 
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101-200 
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027 067 
025 068 
022 037 
020 037 
021 026 
026 028 
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TABLE 3 


Comparison or Prepictep anp Onserven Mean Tora A; Responses ror Exp. 3 


Trials 301-400 
Group 


Obs. 


| 
| 


=|? 


15 | 13.1 
0 29.1 
0-0 20.0 
30-3 21.4 
ASO i1.9 
45-3 29.9 
DO 494 
A-3 49.8 


NmMMNMN NNW 
y ran 





* Predicted and observed significantly different, P « 
** Predicted and observed significantly different, P < 


more than 3% and were generally 
much closer, only the curves for the 
groups receiving EK» are shown, except 
for Group .30-3 over Trials 1-100. 
The constants of the curves and the 
learning summarized in 
Table 2. 

A first test of agreement between 
predicted and obtained asymptotic 
levels of response was made by com- 


rates are 
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Fic. 5. Mean conditional probability differ 


ence of Equation 4 as a function of trial blocks in 
Exp. 3. 


J 
Tris 01-400 
Trials 101 Predicted 
from New @ 
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58.3* 

83.9" 

85.6* 
103.7 
105.5 
130.4" 
127.2** 


125.8** 
125.4** 
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paring predicted and obtained num- 
bers of A, reponses over Trials 301 

400. These tests are summarized in 
Columns 2-4 of Table 3. Only for 
Groups .60-0 and .60-3 is there a sig- 
nificant discrepancy. A more strin- 
gent test is the comparison of total 
responses over Trials 101-400 as pre- 
dicted and observed. The results of 
these tests are given in Columns 5-7 
of Table 3. The values are in the 
proper order for the most part but 
show large absolute discrepancies. 

Inspection showed that the data 
points tended to fall below the curves 
on Trials 101-140 indicating that the 
6 values used were too small. To 
check on this, new values of 6, given in 
the last column of Table 2, were esti- 
mated from Trials 101-200 and used 
to calculate total responses on Trials 
101-400. These values are given in 
the last column of Table 3. These 
second estimates of @ are uniformly 
larger than the original estimates, and 
the agreement between predicted and 
observed numbers of responses is con- 
siderably better. 

As stated in the introduction, the 
difference of the conditional probabili- 
ties, a, of Equation 4 gives a measure 
of stimulus control over S’s respond- 
ing. Figure 5 plots a as a function of 
trial number. There is a general 
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modest increase from the first half 
to the second half of Trials 1-100 
marred only by the anomalous be- 
havior of Group .30-3. The groups 
diverge radically on Trials 101-200 
with all except Groups .60-0 and 
.60-3 showing considerable increases. 
Group .30 then continues at a high 
level whereas Group .15 shows a con- 
siderable decrease in the later trial 
blocks. For the 3 K 2 design, the 
tendency is for the groups differing 
only on type of double event to be 
equal initially but to diverge pairwise 
so that on the last block of 100 trials, 
the Eo groups tend to lie above the cor- 
responding E, groups, the difference 
being near zero for Groups .30-0 and 
.30-3 and increasing with increasing 
frequency of double event. 

These statements are made more 
precise by the application of Grant’s 
(6) extension of Alexander’s test for 
trend over the last three blocks of 100 
trials, as summarized in Table 4. For 
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the two control groups, Groups .15 
and .30, line A indicates significant 
linear and quadratic components for 
the data of the two groups considered 
together. Line B that the 
means for these two groups differ 
over these 300 trials while Line C indi- 
cates that the groups differ in the 
slope of their curves. 

The right-hand colums of Table 4 
give the same analysis for the 3 & 2 
design. Here a further breakdown 
into variance associated with the 
factors in the design is possible. Line 
A indicates that there is no marked 
trend in the 
whole. Line B shows that the group 
means over 300 trials do 
differ significantly. Line C indicates 
significant the linear 
and quadratic components of the 
group curves. For both components, 
group differences arise from frequency, 
but not type, of double event. At the 
same time, there is a significant source 


shows 


data considered as a 


these not 


differences in 


TABLE 4 


Trenp ANnAcysis For ConpiTIonaL Prosasitity Dirrerence or Equation 4 (See Fic. 5 
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Source Term 


A. Over-all trend 
1. Linear 
2. Quadratic 
B. Between-group means 
a Frequency 
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C. Between-group trends 
1. Linear 
a. Frequency 
b. Type 
c. Interaction 
2. Quadratic 
a. Frequency 
b. Type 
c. Interaction 
D. Between ind. means 
E. Between ind. trends 
1. Linear 
2. Quadratic 
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TABLE 5 


Perseverative TenpENCIES FOR THE 
ExreximentaL Groups or Expr. 3, 
as CALCULATED From Equation 5 


Trials 


351- 
400 


M4 
AO 
48 
35 
A6 
A7 























of variation in group slopes arising 
from the interaction of frequency and 
type of double event. This inter- 
action, because of the lay of the points 
on Trials 101-200, reflects itself as an 
effect of type of double event in the 
last trial block which was found to be 
significant (.05) in an analysis of the 
scores for that block. Finally, Line 
D indicates significant individual dif- 
ferences in mean scores. 

The second conditional probability 
difference, b, of Equation 5, gives a 
measure of perseverative tendency 
when responses are followed by the 
uninformative double events. The 
group data for b have been computed 
for the last six blocks of 50 trials, and 
are listed in Table 5. The b values 
are uniformly positive, of the order of 
45, indicating that a strong persever- 
ative tendency exists. There do not 
appear to be any marked differences 
between groups or between trial 
blocks. 

Values of a and b were also com- 
puted for the 24 adaptation trials. 
The values of a ranged from —.12 to 
.07 with a mean of —.01 for the eight 
groups, a result which is consistent 
with the general increases in Fig. 5. 
The b value for the three Eo groups 
combined was .12, which was sig- 
nificantly (.01) above 0 by a x? test. 
For the three E, groups combined, the 
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b value was .06 which was not sig- 
nificantly above 0. These values 
being positive, the implication is that 
there is little or no initial tendency 
to alternate from the last response. 


Discussion 


Effect of trial rate-—In Exp. 2, a weak 
effect of trial rate was found with the 
more extreme light frequencies of 90% 
vs. 50%, while a pronounced rate effect 
was obtained for the less extreme fre- 
quencies of 75% vs. 50%. This is con- 
sistent with the results of earlier work 
(7,10), in which no rate effects were 
found for 100% vs. 0%, and suggests that 
rate effects begin to appear as the situ- 
ation is made more complicated. 

Effect of instructions.—Failure to find 
any noticeable effect of instructions im- 
plies that the behavior is so much the 
more stable. Caution should be used in 
generalizing this conclusion since the 
results on trial rate raise the possibility 
that instructions may prove significant 
when the light frequencies are less ex- 
treme than the 90% vs. 50% used in 
Exp. 2.1. 

Individual differences.—The wide de- 
parture of the distributions of Exp. 2.1 
(Fig. 3) from expectation implies that 
the model in its present simple form does 
not apply to the case where relatively 
extreme values of the event frequencies 
are used. Furthermore, the results for 
the 5-sec. group which ended up very 
close to the theoretical asymptote of 10% 
underscore the remark of Estes and Burke 
(4) that agreement between theoretical 
and predicted learning curves is only a 
first and sometimes insensitive test of the 
adequacy of the model. 

Loosely speaking, it would seem that 
those Ss who gave no nonoptimal re- 
sponses were acting rather differently 
from say, those who gave more than 20% 
nonoptimal responses. If so, it follows 
that a revised model should allow room 
for differences in individual strategies. 

Number of trials.—Inspection of Fig. 4 
indicates that Groups .60-0 and .60-3 
tend to follow the theoretical curves ex- 
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cept for the last 100 trials. However, no 
comparable tendency to “break” ap- 
pears in the data for the other six groups 
despite the large number of trials. 
Asymptote predictions—Except for 
Groups .60—0 and .60~3, satisfactory pre- 
dictions of asymptotic performance in 
Exp. 3 were obtained by setting the 
sharing parameters equal to 1. For 
these two exceptional groups, the differ- 
ence in the total frequencies of the two 
lights is only 10% and the results suggest 
a systematic error in the direction of too 
low a predicted asymptote when the 
differential light frequency is small. 
Estimates of learning rates.—In Exp. 3, 
it had been hoped that the 24 initial 
trials would wash out any initial key pre- 
ferences, negative recency effects, or al- 
ternation tendencies which may have 
existed so that p(0) could be assigned 
the a priori value of .5, and so that the 
estimates of @ from Trials 1-100 would 
be applicable to Trials 101-400. Obvi- 
ously this was not the case, and the readi- 
ness with which the groups reverse their 
key preferences after Trial 100 is in 
marked contrast to the reluctance with 
which they leave the 50% level at the 
beginning of the experiment. This in- 
crease in 6 rendered the predicted curves 
for Trials 101-400 incorrect. When 
learning rates were estimated from Trials 
101-200, considerably more satisfactory 
fits were obtained. While this indicates 
that the model will give a fairly good de- 
scription of the mean learning curves as 
long as the differential light frequency is 
not too small, this is a weaker form of 
validation than had been hoped for. 
Unfortunately, it is not clear from the 
present data how to get appropriate esti- 
mates of @ without using the very data 
to be predicted. The observed increases 
in 6 may have arisen from continued ad- 
aptation to the situation, or from previous 
experience with a change in the identity 
of the more frequent light. A further 
possibility, indicated by Neimark (12), 
is that the value of @ increases with in- 
creases in the magnitude of the change in 
light frequencies. The present data are 
not inconsistent with this idea. 
Conditional probabilities —Perhaps the 


315 


most interesting results are the differences 
of the conditional probabilities of re- 
sponse as given in Fig. 5 and Table 5. 

Both the increase in a over the early 
blocks of trials, as well as the tendency 
for a to overestimate 6, which were found 
here, have been observed by Estes and 
Straughan. They conjectured that these 
effects arose from nonindependence of 
successive samples. Although this ap- 
pears to be a reasonable interpretation 
for the data of Trials 1-100, it is not 
clear how it would handle the differences 
between group means and trends over 
Trials 101-400. Finally, it should be 
noted that the latter data indicate an 
effect of type of double event, although 
no sign of this appears in the mean learn- 
ing curves. 

According to the model, the 4 values 
should also give an estimate of 6. It is 
seen that they are also considerably too 
large. Moreover, they do not agree 
with the a values, either in average mag- 
nitude, or in change with trials. 

Without reference to the model, these 
two conditional probability differences 


have straightforward psychological inter- 
pretations—a as a measure of stimulus 
control exerted by immediately preceding 
single events (possibly confounded, how- 
ever, with the influence of more remote 
events), and 4 as a measure of persevera- 


tive tendency after double events. The 
results of Hake and Hyman (8) indicate 
that further information will be obtained 
when response probabilities conditional 
upon more remote stimulus events are 
considered. The present results suggest 
that such analyses give more precise 
knowledge of the immediate determi- 
nants of behavior, as well as more incisive 
tests of model and theory, than does the 
study of mean performance curves. 

Finally, it should be noted that the 4 
values may well depend on the physical 
components of the response system. Al- 
though little trouble was experienced 
with Ss leaving one key down for several 
trials, it seems plausible that a different 
type of key system would give different 
results for this measure. 

Other proposed models.—Groups .15 
and .30 replicate the experiment of Estes 
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and Straughan (5), and similar results 
were obtained for the mean learning 
curves. The between-groups differences 
in mean and trend of a values which were 
found here arise largely from the last 200 
trials, and the values for the earlier trials 
agree with the data from their shorter 
sequences. Hence, the results for these 
two groups confirm their experiment ex- 
cept for the later increases in learning 
rate and, possibly, the later behavior of 
the a values. 

With respect to the model proposed 
here for the six experimental groups, the 
finding that the best estimate of ap is 
close to 1 is consistent with the model 
proposed by Neimark who made the a 
priori assumption that ap is equal to 1. 
Atkinson (2) has compared three models 
including the cases where a = 4, and 
a = 1, as well as the case in which the 
stimulus sample is not conditioned to 
either response class when an Eo occurs. 
This latter assumption, which is favored 
by his results, leads to exactly the same 
expressions for mean performance curves 
and a values for the event frequencies 
used here as does the assumption ap = 1. 
Thus, although the mean curves do not 
differentiate, it should be possible further 
to test Atkinson’s model against the a 
and 4 values which have been obtained 
here. 

Statistical learning theory.—Although 
we conclude that the model is inadequate 
to meet the demands which we have im- 
posed, this is not meant as an indict- 
ment of the underlying theory. The 
model has given a fair rationalization of 
the mean performance curves, and has 
failed principally in accounting for re- 
sponse distributions and _ conditional 
probabilities which are not even obtain- 
able from other existing theories. The 
basic notions of statistical learning the- 
ory are concerned with a rigorization of 


the notions of stimulus and response, 


permitting the possibility of precise pre- 
diction. The partial success of the pres- 
ent model indicates more that the be- 
havior under study is not as simple as the 
model than that the 
theory is incorrect. 


assumed basic 
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SUMMARY 


This paper reports a model for two-choice 
predicting behavior, and three experiments which 
evaluate the model. The is based on 
statistical learning theory, being an extension of 
the Estes-Burke-Straughan model to the case 
where either response alone (single events), both 
responses, or neither response (double events) 
may be correct in any trial. Sharing parameters 
are introduced to represent the proportion of 
stimulus elements conditioned to the two re 
sponse classes on each type of trial. 

In each experiment, Ss were required to 
predict which one of two lights would flash next 
by choosing on each trial exactly one of the two 
possible responses. 

The results of Exp. 1 were consistent with the 
model but raised questions as to its limits of 
applicability. 

Experiment 2 tested effect of four 
instructions and three trial rates. Instructions 
had no discernible effect. ‘Trial rate had a weak 
initial effect when the light frequencies were 90% 
vs. 50%, and a pronounced effect when the 
frequencies were 75% vs. 50%. Asymptotic 
response rates tended to lie at or below the 
minimum. 
sponses about the asymptotic mean departed 
widely from those predicted by the model. 

Experiment 3 was designed primarily to test 
the model. 
events as in the experiment of Estes and Strau 
ghan. 
two groups generally substantiated their results. 
However, trend analysis of conditional prob- 
abilities of response indicated a possible dis- 


model 


sets of 


theoretical Distributions of re 


Two groups received only single 


The theoretical learning curves for these 


crepancy from their model, 

Six additional groups formed a 3 X 2 design 
with both 
responses correct) and probability (.30, .45, .60) 
of double event as factors. Analysis of variance 
showed that only probability of double event had 
a significant effect on the mean learning curves. 
The model gave fair, but not perfect, predictions 
of terminal response rates and, when an increase 


type (neither response correct, 


in learning rate later in the trial sequence was 
taken into account, of the entire learning curves 
over Trials 101-400. 

Two differences of conditional probabilities of 
response, one a measure of stimulus control over 
S, the other a measure of perseverative tendency 
after a event, 
Trend analysis of the former showed significant 
between-groups trends. Inspection of the latter 
showed little effect of group treatments or trials 
Although, within the model, both differences 
give estimates of the learning rate, both were 
considerably larger than learning rates obtained 
from the learning curves. Nor did they agree 


double were also computed 
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well with each other, either in magnitude or in 
change with trials 

It is concluded that the model should be 
useful in predicting mean performance over a 
fairly wide range of stimulus conditions but that 
it is inadequate to handle response distributions 
or conditional probabilities in its present form. 
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REWARD AND PUNISHMENT IN A MINIMAL 
SOCIAL SITUATION ! 


JOSEPH B. SIDOWSKI 
San Diego State College 


The study of conditioning in the 
laboratory, and the development of 
conditioning theory, are justified in 
part by the hope that the principles 
derived will ultimately enhance the 
understanding of human behavior in 
complex situations including social 
ones. In the present study an at- 
tempt is made to re-evaluate the es- 
sential features of a social situation 
viewed entirely within the framework 
of conditioning theory, and to investi- 
gate the simplest situation that could 
be considered truly social within this 
framework. Analysis of social be- 
havior in these terms implies that no 
assumption whatever is to be made 
regarding such concepts as attitude, 
understanding, etc. If such factors 


enter the picture at all, they are to be 
considered as manifestations of com- 


plex habits. Preferably, they will 
not be enountered at all in the mini- 
mal social situation, and in any event 
will not be part of the definition of 
social behavior. ; 

In this analysis it will be assumed 
that the main factors controlling 
social behavior are reward and punish- 
ment. Within this framework, the 
essential features of a social situation 
are: (a) Two or more Ss have at their 
disposal responses which result in re- 


' Based upon a thesis submitted in partial 
fulfillment of the requirements for the degree 
of Doctor of Philosophy at the University of 
Wisconsin. The author wishes to express his 
gratitude to his adviser, Dr. D. A. Grant, and 
to Drs. L. B. Wyckoff and R. Miles for assistance 
in the preparation of this paper. ‘The author 
also wishes to acknowledge the aid of the 
Numerical Analysis Laboratory of the University 
of Wisconsin. 


warding or punishing effects on other 
Ss. (b) The principal sources of re- 
ward and punishment for any S de- 
pend on responses made by other Ss. 
(c) The responses controlling reward 
and punishment are subject to learn- 
ing. It will be assumed that, what- 
ever else may be involved, at least the 
above features are present in any 
social situation. 

It is of interest to determine what 
learning would be predicted on the 
basis of these conditions alone, as- 
suming only that the principles of 
conditioning theory are operating. 
This was the purpose of the present 
experiment. 


Two Ss isolated from each other were pro- 
vided with push buttons by means of which each 
could give the other reward or punishment. In 
this situation, if S makes a response, he does not 
“suffer the consequences” directly. An S can 
change the rewards and punishments he receives 
only to the extent that this action may change 
the behavior of the other S. Thus, if 4 makes a 
response which results in a reward for B, B may 
be expected to continue with, or to repeat, his 
immediately preceding behavior; if 4 makes a 
response which results in a punishment for B, 
B may be expected to discontinue his immedi- 
ately preceding activity. Such changes in B’s 
behavior would have a direct effect on the 
reward and punishment being received by 4. 
In this way, 4 would receive some consequences 
from his own responses, but in an indirect way. 


It is not intended that this situation 
be directly comparable to familiar 
everyday encounters. People gen- 
erally enter social situations already 
equipped with a massive amount of 
prior learning relating to the effects 
of their behavior on others. This 
makes rigorous control of a given 
social experiment next to impossible. 
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However, much of social control of 
other people’s behavior is through 
stylized ways of indicating approval 
and disapproval that are in effect 
methods of rewarding and punishing 
others. With adults, most of the re- 
warding and punishing techniques are 
conventional, but in many situations 
the responses learned are no longer 
appropriate. New responses must be 
learned. The process through which 
this learning and relearning takes 
place will be central to any extension 
of conditioning theory into the realm 
of social behavior. One may ask, Is 
face-to-face contact and “‘understand- 
ing” of the relationship between Ss 
necessary for “social” learning? The 
question remains as to whether learn- 
ing will occur simply as a result of the 
operation of reward and punishment. 
Also, does it facilitate or hinder learn- 
ing for the S to know that there is 
another person in the situation? 

The first step in a program designed 
to answer these questions is an investi- 
gation of the course of learning in the 
minimal social situation. The Ss were 
divided into two major groups, an 
Informed and an Uninformed group. 
All Ss in the Informed group were 
told that another S was serving in 
the experiment. In the Uninformed 
group, the effects of previous social 
learning were minimized since Ss were 
not told (nor did they guess) that 
they were in a social situation at all. 
The two major groups were sub- 
divided into three Reward-Punish- 
ment subgroups in which: (a) Ss 
could give each other Shock Only, (6) 
Ss could give each other Score Only, 
and (c) Ss could give each other both 
Shocks and Scores. 


MetTuop 


Subjects—The Ss were 180 members of 
elementary psychology courses at the University 
of Wisconsin who received class points for 
participating. 
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Apparatus—The two experimental rooms 
were each equipped with a control panel which 
provided two push buttons, an electrically 
operated counter, and a pair of electrodes. 
Thus, pressing the right-hand button in Room | 
registered one point on the counter in Room 2, 
while pressing the left-hand button in Room 1 
delivered an electric shock to the electrodes in 
Room 2. In Room 2 the right and left buttons 
controlled the shock and counter, respectively, 
in Room 1. A white light served as a signal that 
the experiment had begun. It remained on 
until the session ended. The &’s control board 
contained counters which indicated the number 
of times that each of the four buttons had been 
pressed. In addition to the counters, an Ester- 
line Angus recorder was used to give continuous 
time record of each response, and of the four 
possible combinations of pairs of consecutive 
responses. For example, one pen of the recorder 
operated when a left button response by 4 was 
followed by a left button response by B; another 
pen recorded when a left button response by 4 
was followed by a right button response by B, 
ete. 

The shock was delivered through the plate 
circuit of a 6SK7 tube so that very high resist- 
ance was in series with S. This minimized 
effects of changes in S’s resistance. Separate 
controls were available for each of the two Ss. 
Shock duration was regulated by a relay con- 
trolled timing circuit which permitted a constant 
shock interval for each S. 

Design and procedure.—A 2X3 factorial 
design was used in the experiment. ‘The 180 Ss, 
90 pairs, were randomly assigned to six groups 
consisting of 15 pairs each, with the limitation 
that each group consisted of 5 pairs of males, 
5 pairs of females, and 5 mixed pairs. All of the 
Uninformed conditions were run first and all 
Informed conditions were run last to minimize 
accidental information to S. Within each of 
these conditions Ss were assigned at random to 
the subgroups. 

In the Uninformed condition, Ss were not told 
that another S was serving in the experiment. 
In the Informed condition, Ss were told that 
another S was serving in the experiment and 
that their own scores and shocks were controlled 
by the other S. Within the Uninformed and 
Informed conditions there were three sub- 
conditions. Under the Shock Only condition, 
the score circuit was inoperative but the shock 
circuit remained in operation. Subjects under 
the Shock Only condition, therefore, received no 
scores.. Under the Score Only condition, the 
shock circuit was inoperative so that Ss could 
receive scores only. Under the Shock and 
Score condition, Ss were able to receive and 
administer both shocks and scores. 
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The amount of shock which S received was a 
function of his own threshold. A_ threshold 
circuit was included in the apparatus in order to 
test the threshold of each S before entering the 
experimental rooms. Preliminary tests indi- 
cated that the absolute threshold was more 
reliable and clearly defined than a “pain”’ thresh- 
old, so this measure was used as a basis of shock 
levels. of each S’s absolute 
threshold were taken and the mean of his three 
readings was used as the threshold for each S. 
An analysis of variance was carried out on the 
thresholds of the six groups and no significant 
difference was found. A previous study (4) 
showed that strong shock was more conducive to 
learning in the minimal social situation than 
weak shock. ‘Therefore, three times the absolute 
threshold in milliamperes was used as the shock 
level in this experiment. 


Three readings 


Any questions con- 
cerning the current were ignored; however, each 
S was offered the opportunity to withdraw from 
the experiment before the session began. One S 
withdrew. 

After the threshold readings were completed, 
S was taken to one of the two experimental 
rooms where he was seated before the S control 
board, Electrodes were attached to the first 
two fingers of his left hand. Instructions were 
then read to S after which he was told to relax 
for a few minutes while the apparatus warmed 
up. During this time, E brought the second S 
before the £ panel and went through the same 
procedure, ‘The second S was taken to Room 2 
and seated before the second control board. 
Instructions were then read to the second S. 


All Ss were given the following instructions: 


“When the white light goes on the experiment 
begins. The experiment is in progress as long 
as the white light stayson. You can press either 
of these buttons in any manner that you wish, 
and as frequently as you wish. Do not attempt 
to press both buttons at the same time and use 
only your right hand for pressing. You are to 
keep your left hand on the table. Do not holda 
button down, but just press and release it. The 
object of the experiment is to make as many 
points as you can. Your point score will appear 
on this counter, and the counter will turn each 
time that you score a point. When the white 
light goes off, it is a signal that the experiment 
has ended. Remain seated until £ comes into 
the room, Remember, you are to try to make 
as high a score as you can in any manner that 
you can,” 


In addition to the above instructions, the 
Informed groups were told the following: 


“There is another S in another room who 
controls the number of shocks and the number of 


scores which you will receive. You in turn 
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control the number of shocks and scores which 
the other S receives.” 


Under no condition was any mention made by 
E of the function of the buttons or of the 
electrodes. Questions concerning anything other 
than the clarity of the instructions were ignored. 
When S asked about the instructions, the in- 
structions were merely repeated. 

At the conclusion of the experimental session, 
a series of questions was asked in order to 
determine whether or not any Ss in the Un- 
informed groups were aware of the presence of 
another S in the experiment. Subjects in all 
groups were questioned concerning the function 
of the buttons, electrodes, and experiment. 
Responses to the questions at the end of the 
experiment indicated that none of the Ss in 
Uninformed groups was aware that a second S 
was involved in the experiment. 

The experimental session lasted for 25 min. 


RESULTS 


In the present study, the dependent 
variable was defined as the number of 
button-pressing responses (Shock and 
Score) made by each S. The per- 
formance of each S was recorded in 
terms of the number of ‘Shock and 
Score responses per minute. 

In order to clarify the analysis, the 
results of the experiment are presented 
in two sections. (a) In. the initial 
analysis, Score and Shock responses 
are reported over the entire 25-min. 
session in blocks of 5 min: each. (b) 
A finer analysis is presented on the 
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Fic. 1. Mean number of Score button re- 
sponses per minute over blocks of 5 min. each 
for the 25-min session. 
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Fic, 2. Mean number of Shock button re- 
sponses per minute over blocks of 5 min. each 
for the 25-min. session. 


single minutes of the first 5-min. 
period and data are presented on some 
of the conditional dependencies in the 
sequences of responses in order to aid 
in the interpretation of the results. 
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Results of 25-min. analysis.—The 
data over the 25-min. session are 
plotted in Fig. 1 and 2. Figure 1 
shows the mean number of Score re- 
sponses per minute for each of the six 
experimental groups while Fig. 2 
shows the mean number of Shock re- 
sponses per minute. These two types 
of response were analyzed by means 
of Grant’s extension of the Alexander 
trend test (2). 

Inspection of the curves of Fig. 1 
indicates that the mean number of 
Score responses increases over the 
25-min. period for all except the two 
Shock Only groups. Further, each 
Uninformed group seems to do better 
than the corresponding Informed 
group. 

More precise information on the 


TABLE 1 
Summary Awnatysis or Trenps ror Scores Over 25-Min. Perion 


Source of Variation 





MS Error Term F 





A. Over-all trend 
1. Linear 
2. Quadratic 
3. Cubic 
4. Quartic 


B. Between-group means 
a. Information 
b. Reward-Punishment 


c. Inf. X R-P 


C. Between-group trends 

1. Linear 
a. Information 
b. Reward-Punishment 
c. Inf. K R-P 

2. Quadratic 
a. Information 
b. Reward-Punishment 
c. Inf. x R-P 


D. Between-ind. means 


E. Between-ind. trends 
1. Linear 
2. Quadratic 
3. Cubic 
4. Quartic 


F. Total 





*P = OS. 
-P = Ol. 


679,683 37.89°* 

2,179,315 56,37%* 

491.794 32.96%* 
29,371 2.91 
18,252 2.26 


2,087,920 8.47°* 
586,143 2.38 
4,737,484 19.23%* 
189,244 77 


$0,958 
145,627 
17,710 
354,162 
1/051 
45,346 
3,269 
94,450 
17,281 


2.84°* 
S.a7° 


AO 
9. 16** 


RRR =e ee 


246,388 13.73°* 
17,940 
38,661 
14,921 
10,107 
8,072 
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rate of Score responding is given by 
the trend shown in Table 1. The 
existence of learning is shown by the 
significance of the linear and quadratic 
components of the over-all trend. 
The significance of the linear and 
quadratic components of between- 
groups trend shows that the learning is 
not the same for all groups. ‘There 
were no group differences between 
cubic and quartic components which 
have omitted from Table 1. 
A further breakdown into the factors 
of the orthogonal design shows that 
this difference in trends is associated 
with the three Reward and Punish- 
ment conditions. A_ similar state- 
ment holds for the analysis of the 
group means: significant differences 
arise only from the Reward and Pun- 
ishment conditions. The apparent 
superiority of the Uninformed groups 
is not substantiated. Finally, the 
siynificant differences between indi- 
vidual means show merely that there 
are reliable individual differences. 
Since 


been 


significant differences be- 
tween group means were found, Dun- 
can’s test (1) for differences between 
ranked means was carried out on the 
Score data. The differences between 
the two Shock Only conditions and all 
other conditions were significant at the 
Ol level. No significant difference 
was found between the Shock Only- 
Informed and the Shock Only-Unin- 
formed groups, nor were the remaining 
four score conditions significantly 
different. ‘The reward (Score), there- 


fore, seems superior to punishment 
(Shock). 


A trend analysis (2) was also used 


to test the Shock response data. Only 
the quadratic component of the over- 
all trend was significant. This sug- 
gests the presence of curvilinearity or 
“bowing” in the rate of responding to 
the Shock button as a function of 


time. A lack of significance between 
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group means and trends indicated that 
all differences in the behavior of the 
various groups were characterized by 
their rate of responding to the Score 
button. 

A repeated-measures analysis of 
variance was carried out over the last 
5-min. period of the experiment using 
the Shock button score and Score 
button responses as the two measures 
for each S. The analysis of variance 
showed significant differences (.01 
level) between groups, which arose 
from the influence of the’ Reward- 
Punishment condition. A significant 
Button effect reflected the greater 
number of responses made to the 
Score button. Finally, a significant 
Groups by Button interaction showed 
that the difference in rate of respond- 
ing to the two buttons was different 
for the various groups. The applica- 
tion of Duncan’s test (1) showed that 
the two Shock Only groups gave sig- 
nificantly fewer score responses than 
the remaining four groups, but no 
further differentiation of the groups 
was found. The same analysis, when 
applied to the Shock responses, showed 
no significant variation between the 
six groups. 

Results of the analysis of first 5 min. 
and of sequences of responses.—Analy- 
sis of the data for single minutes of 
the first 5-min. period showed that 
learning did take place during this 
time. ‘Trend analyses of the score 
responses showed significant between- 
group trends (linear component) and 
significant between-group means with 
the differences being due to differences 
between Reward-Punishment condi- 
Analysis of the Shock button 
responses showed a significant quartic 
component suggesting the presence of 
reliable tempora! fluctuations or oscil- 
lations in rate of responding over this 
initial period. 


tions. 


On a priori grounds it was thought 
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that learning would occur according 
to the following principles: (a) If Sa 
presses the Score button, Sg will tend 
to keep pressing whichever button he 
pressed previously; (b) If Sq presses 
the Shock button, Sg will tend to 
change buttons. It. is easily seen 
that, singly or in combination, Princi- 
ples a and b will tend to lead to Score 
responses for both Ss. Hence, there 
should be an increase in Score-Score 
responses, a decrease in Shock-Score 
and Shock-Shock, and possibly an 
initial increase in Score-Shock followed 
by a decrease. To see if this hap- 
pened, sequences of responses by 
pairs of Ss were recorded and plotted 
for each of the six groups over the 
entire 25-min. session and also over the 
first 5-min. interval. 

Figure 3 shows the mean number of 
sequential responses per minute for 
the Shock Only-Informed group over 
the 25-min. session. It is representa- 
tive of the types of sequential curves 
found for the Shock Only conditions 
over the entire session and over the 
first 5 min. Figure 4 the 
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shows 


Score-Informed group, and is repre- 


sentative of the types of curves found 


for the Shock and Score, and the 
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Fic. 3. Mean number sequential responses 
per minute for the Shock Only-Informed group 
blocks of 5 min. each for the 25-min. 
session. 
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Fic. 4. Mean number sequential responses 
per minute for the Shock and Score-Informed 
group over blocks of 5 min. each for the 25-min. 
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Score Only conditions over the 25-min. 
session. Figure 5 shows the sequential 
responses for the Shock and Scores- 
Informed group over the first 5 min. 
It is representative of the types of 
curves found for the Shock and Score, 
and the Score Only-Uninformed groups 
during the first 5-min. The se- 
quences which are presented are the 
+ + (Score response by 4 followed 
by Score response by B), — (Shock 
response by 4 followed by Shock re- 
sponse by B), + 
followed by Shock and 
- + (Shock response followed by 


Score The se- 


(Score response 
response), 


respt mse) seq uences, 
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Fic. 5. Mean number sequential responses 
per minute for the Shock and Score-Informec 
group over successive 
first 5 min. 


l-min. intervals of the 
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quence curves were included in order 
to aid in the interpretation of the data 
and were not subjected to statistical 
test. ‘These curves may be described 
as follows. For those groups which 
show no learning (Shock Only groups 
represented by curves in Fig. 3), all 
four sequence curves maintain roughly 
the same constant level throughout 
the course of the experiment. For 
those groups which show learning 
(Score Only and the Shock and Score 
groups represented by curves in Fig. 
4 and 5), the + + sequence curve 
shows large increases, while the other 
three sequence curves hover at about 
the same constant level, with possible 
slight decreases toward the end of the 
experiment. Therefore, Principle a 
seems to hold; however, Principle b is 
dubious. 


Discussion 


In setting up the design of the present 
experiment, two primary questions were 
raised: (a4) Would learning take place in 


such a minimal social situation? (4) 
What would be the effect of Reward- 
Punishment variables and the factor 
of Information (Informed, Uninformed) 
upon results? 

First, the results indicate clear evidence 
of learning, i.e., Score button responses 
increased even in the first 5 min. of the 
session. The Shock button responses, 
however, did not seem to noticeably de- 
crease or increase during the entire ses- 
sion. Second, reward (Score) seemed to 
be more effective than punishment alone, 
and a combination of reward and punish- 
ment was highly effective. Third, social 
information added nothing. In fact, 
such differences as were seen were nega- 
tive. 

Theoretical interpretation is difficult in 
the present experiment. Detailed ex- 
amination of the results suggests a num- 
ber of possible interpretations and rules 
out others. First of all there are two 
possible types of learning that might 
occur: (a) Each S may simply change his 
over-all rate of responding under the in- 
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fluence of the scores and shocks received 
from the other S, but without favoring 
either one of his two responses over the 
other, or (6) Ss may develop differential 
response rates to the two buttons. While 
it is possible that learning of the first 
type would be largely independent of the 
actions of each S’s partner, it is clear that 
learning of the second type cannot occur 
without an interaction between Ss in 
each pair which yields differential con- 
sequences to the two responses. The 
second type of learning was the only one 
found as shown by the failure of the 
Shock responses to participate in the 
upward trend of the scores. Therefore, 
the matter of primary concern is the 
second type of learning. 

Two groups appeared to have little 
change in behavior or learning. These 
were the Shock Only groups. It was ex- 
pected that punishment (Shock) alone 
would have more of an effect than is 
shown in the present study. Since the 
rate of résponse in the Shock Only condi- 
tions remain relatively stable, rather 
than decreasing in time, one might ex- 
pect a tendency for Ss in these groups to 
learn to favor the non-shock button since 
a sequence of responses on these buttons 
would eliminate the punishment. In- 
spection of the curves, however, shows 
that this did not occur. One possible 
explanation of the fact that Ss did not 
learn to press the non-shock button sig- 
nificantly more often than the shock 
button is that the over-all response rate 
is low for the Shock Only groups, and per- 
haps a minimal rate of responding is 
necessary for learning to occur in this 
situation. A second possible explana- 
tion is that receiving shock and not 
receiving scores both act as negative 
reinforcers. A sequence of shock-button 
responses by the pair of Ss, therefore, 
may act as a negative reinforcer because 
of its punishing effect. A sequence of 
non-shock button responses may also act 
as a negative reinforcer because of the 
fact that no score is forthcoming. The 
final effect is a counterbalancing of nega- 
tive reinforcers leading to continual 
oscillation between the buttons. 

Now remains the critical problem of 
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how learning took place in the other four 
groups. It is necessary to account for a 
rise in the scores and for the fact that 
the shock responses remain essentially 
unchanged. First, the social informa- 
tion factor has been shown to be unim- 
portant in this experiment. Verbal re- 
ports of Ss in the Uninformed groups 
indicated that none of them were aware 
of the fact that another S was serving in 
the experiment. In fact, most Ss under 
this condition stated that they thought 
they were working against a set pattern 
which was being reproduced by an elec- 
tronic circuit or that they were working 
against a machine. Many Ss stated 
that they had absolutely no control over 
what scores or shocks they received since 
the apparatus operated automatically. 
These verbal reports clearly give little 
indication of what actually happened or 
how it came about that these Ss in- 
creased their score responses so nicely. 
Some explanations based on effects of 
rewarding and punishing specific re- 
sponses, however, lead to interesting con- 
clusions. For example, if one examines 
some of the possible critical sequences of 
responses the following matrices are ob- 
tained: 


Type I Type I 


Sequence A Sequence 


6 


y | 
4. 
™ 


I 
2 
3 
4 


In the above matrices, the + specifies 
the Score button response while the — 
refers to the Shock response. 4 and B 
refer to the two Ss, i.e., in Sequence 1, 
S, presses a Score button; this is followed 
by a Score button response by Ss which, 
in turn, is followed by a Score response 
by Sa. In the Type I matrix, it would 
be expected that any one of the sequences 
of responses made by Sa, then Ss, then 
S, would result in an increase in that 
particular sequence because S§, will 
tend to repeat responses followed by re- 
ward in that particular sequence. Inthe 


325 


Type II matrix a decrease would be ex- 
pected in that particular sequence be- 
cause S, will tend to change a response 
followed by punishment in that particular 
sequence. If the above assumptions are 
accepted it is found that, on the average, 
they will lead to learning because only 
++ +willgive+ +++ + sequence; 
the others give + — + — — + — 
sequences. At least two additional as 
sumptions, therefore, must be made: (a) 
some “random oscillation,” and (4) a 
dominance or ” characteristic 
of reward; i.c., + + + sequence must 
have a greater effect than any minus 
sequence. This latter 
empirically reasonable because Score 
groups learned better than Shock groups. 
Turning to the actual sequences, there 
is some evidence favoring the above in- 
terpretation. The only sequence that 
increases markedly in the groups that 
learned is the + + sequence. 


“trapping 


assumption is 


SUMMARY 


The present experiment was designed to 
investigate the influence of reward (Score) and 
punishment (Shock) in a minimal social situa 
tion. Two Ss, isolated from each other, were 
provided with two push buttons by means of 
which each could give the other shock and 
scores. 

The Ss, 90 pairs, were divided into two major 
groups, an Informed and an Uninformed group 
All Ss in the Informed group were told that 
another S was serving in the experiment and that 
the other S controlled his Shocks and Scores 
In the Uninformed groups, Ss were not told that 
they were in a social situation. The two major 
groups were subdivided into three Reward 
Punishment subgroups in which Ss could give 
each other Shock Only, Score Only, or Shock and 
Score. 

The results indicate that learning occurred in 
the Score Only, and Shock and Score conditions 
but not in the Shock Only conditions. Learning, 
therefore, was demonstrated in those groups 
where S had an opportunity to receive a reward 
(Score). Learning under these conditions oc 
curred regardless of whether or not S was told 
that another S was serving in the experiment 
There were no significant differences attributable 
to whether or not S knew he was working with 
another S. 





326 


It was concluded that the major factors con- 
trolling the behavior of Ss in the present study 
were the Reward-Punishment variables, that 
awareness of the sociality of the situation had 
little effect, and that reward had a greater effect 
than punishment. 
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EFFECTS OF VARIED DISTANCE ON SHORT-RANGE 
SIZE JUDGMENTS! 


NOEL JENKIN? 


University of New Brunswick 


The “law” of phenomenal regression 
to the real object proposed by Thou- 
less (9) predicts that an S, matching 
a standard object at a far distance 
with a comparison object at a near 
distance, will choose a value inter- 
mediate between the physical size of 
the standard and one which would 
subtend the same visual angle as the 
standard. ‘The generality of this as- 
sumption has been challenged by later 
writers (1, 2, 3, 7, 8) who have clearly 
shown that such a “law” is true only 
for a given attitude and understand- 
ing of the task, i.e., the “‘analytic,” 
“retinal,” or “stimulus” attitude. 
Under the contrasting type of condi- 
tion, where S is set to make an “ob- 
jective” match, i.e., to adjust the vari- 
able to a physical size equaling that 
of the standard, not only is “‘size con- 
stancy”’ generally achieved, but “over- 
constancy” is commonly found. This 
means that in a situation where the 
standard stimulus is far and the vari- 
able comparison stimulus is near, a 
match will be made which in its physi- 
cal dimensions exceeds the value of 
the standard. Furthermore, as 
shown by the work of Gibson (2), 
Gilinsky (3), and Smith (8), this effect, 
within a certain limit, increases with 
distance, so that a receding object of 
constant physical size is progressively 
overestimated 

The investigations which have with 
fair unanimity established this find- 
ing, have mostly used distances of 

1 The author is indebted to Dr. Ray Hyman 
of Harvard University for critically reading the 
first draft of the present manuscript 


2 Now at The Training School at Vineland, 


N. J. 


considerable extent. Gibson's far- 
thest distance was about half a mile, 
Smith’s was 320 ft., and Gilinsky’s 
was 4000 ft. All three investigators 
supplied a visual fieid rich in cues of 
distance. Chalmers (1) attempted 
with apparent success to eliminate 
these secondary cues of distance and 
found progressive overestimation from 
binocular judgments at 15 and 20 ft. 
Overestimation in decreasing amount 
was then maintained up to 80 ft. 

It is evident that overestimation, 
over a large range of distances is a 
well-attested phenomenon, and it, 
seems, as Smith (8) has pointed out, 
that this is not readily accounted for 
in terms of the space-error hypothesis 
of Holway and Boring (5). The 
generality of the phenomenon requires 
testing by the study of judgments at 
distances shorter than 20 ft. 

An investigation by the present 
writer (6) suggested that larger esti- 
mates would be obtained with the 
comparison object at | or 2 ft. than 
with the comparison at 15 ft., when 
the standard object remained constant 
at 30 ft. The purpose of the present 
study was to check this finding under 
properly controlled conditions, and to 
test the hypothesis, also suggested by 
the former investigation, that accu- 
racy of judgment would be greater 
when the comparison was further 
from S than when it was at the 2-ft. 
distance. 

Metnop 

Apparatus.—The apparatus consisted of white 
cardboard squares. The standard card, 4 in 
square, was mounted on a wooden post, 474 in 


above floor level, and was positioned directly in 
front of S, at a distance of 20 ft. from his eyes. 
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The comparison series was a set of squares 
ranging from 2} in. to 5} in. by steps of § in. 
They were individually displayed upon a plat- 
form at the same height as the standard, but 
positioned on S’s left at an angle of 90° with 
respect to the direction of the standard. The 
platform measured 27 X 24 in. and was papered 
in a dull green. A background of similar ma- 
terial, measuring 15 K 24 in. was fastened at the 
rear of the platform. Concealed pins and a 
slight depression, invisible to S, served to main- 
tain the cards in vertical position in the middle of 
the platform. No special background was pro- 
vided for the standard object. A large room, 
well and constantly lighted by daylight and 
containing no interposed objects, provided a 
setting for the apparatus. 

Subjects.—The Ss were 48 students in 
elementary psychology courses at the University 
of New Brunswick Summer School, 24 of each 
sex, who were required to serve in the experiment 
as part of their course work and who received 
credit points for doing so. At the time tested, 
all Ss were naive with regard to size and distance 
theory and experimentation, and all had ap- 
proximately normal vision or vision satisfactorily 
corrected. ‘They were randomly assigned to four 
subgroups of 12 Ss each, with an equal number 
of males and females in each group. 

Design.—Half of the Ss (Group I) first made 
judgments of the standard when the stimuli were 
equidistant at 10 ft. They then made their 
judgments at the disparate distances. With the 
other half of the sample (Group II), this pro- 
cedure was reversed, so that the equidistant 
judgments were made last. Half of the Ss in 
each main group were presented with the com- 
parison first at 10 ft. and then at 2 ft. (Sequence 
A). The other half of each group was treated 
with the converse order (Sequence B). All four 
subgroups therefore made judgments with 
standard and variable displayed at (a) 10 ft. and 
10 ft., (b) 20 ft. and 10 ft., (c) 20 ft. and 2 ft. 
For Group IA the order was a, b, c; for Group IB 
it was a, c, b; for Group IIA it was b, c, a; and 
for Group IIB it was ¢, 4, a. 

Procedure.—The height of each seated S’s eye 
level was adjusted to correspond with that of the 
stimulus objects by providing him with an 
appropriate cushion or cushions. Instructions 
to S emphasized an objective attitude. Each 
card as it was presented on the platform was to 
be judged as larger, equal to, or smaller in 
physical size, with respect to the standard. 
When a card was described as equal, it was to be 
S's opinion that its dimensions would measure 
the same as those of the standard card. From 
each S, two judgments were secured at each 
distance, one from an ascending series and one 
from a descending series. For each subgroup, 
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half of the Ss started with an ascending series and 
half started with a descending series. Prior to 
the first series one judgment was secured in order 
to illustrate the procedure. The card used for 
this practice trial was one outside the range of 
the comparison series proper. For Ss beginning 
with an ascending trial, the practice card was 6 
in. and for those beginning with a descending 
trial, it was 2hin. ‘The S was required to watch 
the standard, while E placed the comparison 
card in position. The S then turned his head 
and observed the latter. He was not prevented 
from looking back and forth if he wished to do so. 
After making his judgment, he was required to 
watch the standard again while E replaced the 
comparison card. After the judgments were 
obtained at the first distance, the stimulus 
objects were quickly moved to the second posi- 
tion. Each card in the comparison series was 
seen by the S no more than twice. A second, 
identical set of cards was used for the third 
judgment. Each series was begun with a 
different sized card. 


RESULTS 


The mean size judgments for each 
subgroup at each distance are shown 
in Table 1. They are expressed in 
raw units in order to facilitate com- 
parison with the physical size of the 
standard being judged, i.e., 4 in. A 
preliminary analysis of variance of all 
judgments at the disparate distances 
showed that Sex Differences and inter- 
actions of Sex with Groups (prior 
versus latter exposure to the equidis- 


TABLE 1 


Mean Size Jupcments or 41. STANDARD 
at Two Distances 


; 
| Group | Group II 


| (10 ft./10 ft. 
| Judgments 
First) 


(10 ft./10 ft. 
Judgments 


Last) | Grand 


Distances 


20 ft./10 ft. 
20 ft./2 ft. 


10 ft./10 ft 4.038 4.031 


* In Sequence A, the comparison stimulus occurred 
first at 10 ft. and then at 2 ft.; in Sequence B the reverse 
was true. 
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TABLE 2 


ANaALysis oF VARIANCE OF Size JUDGMENTS 


Source MS | PF 


| 1496 | 0.79 
0.08 


Groups (G) 


Sequence (S) 157 | 
1891 | 
9184 | 


Subjects 
Distances (D) 
GxD 
SxD 
Gxs 
GxDxs 
Residual j 


25.87°%* 
0.82 
0.26 
0.24 
8.71* 


Note.—-The Groups variable represents prior 
exposure to equidistant condition vs. no prior exposure. 
*P < Ol. 


-P < OO. 


tant condition), with Sequence, and 
with Distances were all nonsignificant 
(P > .05) hence the data from both 
sexes were combined in subsequent 
analysis. Examination of the com- 
bined data for the 20 ft./10 ft. and the 
20 ft./2 ft. distances showed that the 
means tended to be proportionate to 
the SD’s. The variances were there- 
fore stabilized by means of a logarith- 
mic transformation and an analysis 
of variance was performed on these 
measures. ‘Table 2 gives the results 
of this analysis, which strongly sup- 
ports the hypothesis that judgments 
at the 2-ft. distance would be sig- 
nificantly larger than those at the 15- 
ft. distance. It may also be noted 
from Table 2 that there is a negligible 
difference between groups with prior 
versus latter exposure to the equi- 
distant condition. A nonsignificant 
F value is also found for Sequence 
(order of presentation of the two dis- 
parate distances) and for each of the 
simple interactions. A_ significant 
triple interaction indicates that prior 
exposure to the equidistant stimuli 
has differential effects on judgments 
at the disparate distances, depending 
on which distance is experienced first. 
The variance estimate for the triple 
interaction is not so large as to render 
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equivocal the interpretation of a 
difference between means for the two 
distances. It is very clear that the 
distance variable accounts for the 
great majority of variance in the data. 

The equidistant judgments served 
as a control in testing the hypothesis 
that the accuracy of judgment would 
be greater at the 10-ft. than at the 
2-ft. distance. Since the group mak- 
ing equidistant judgments first did not 
differ significantly from that making 
them last (t = .25), the data from the 
10 ft./10 ft. distance were combined. 
A test of the hypothesis that the mean 
of these judgments did not differ sig- 
nificantly from a parent population 
whose true mean was 4.0 in. (the phys- 
ical dimension of the standard) found 
a t of 1.06, P > .25. A similar test 
applied to the judgments made at 20 
ft./10 ft. found at of .15. Therefore 
it was concluded that there was no 
reason to suppose that under either 
of these conditions the sample was 
deviating by more than chance fluctu- 
ation from a value equal to the ob- 
jective, physical size of the standard 
being judged. The same test applied 
to the 20 ft./2 ft. judgments found a t 
of 3.72, P < OO1. At this distance, 
the sample mean differs significantly 
from an assumed parameter of 4 in., 
in the direction of overestimation. 
It would appear to be unlikely that a 
“space error” is responsible for this 
overestimation, in view of the non- 
significant difference from 4 in. of the 
equidistant judgments. 


Discussion 


The distant object remained constant 
throughout the experiment. Since it 
was matched from a comparison series 
both at 2 ft. and 10 ft., it follows that the 
two matching cards were also subjectively 
equal to each other. Actually, the one 
chosen at the near position tended to be 
physically larger than that chosen at the 
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farther position. At first thought it 
might seem that this trend is consistent 
with the geometry of perspective, i.e., 
the object decreasing in size with 
distance. Further reflection, however, 
makes it clear that an object of given 
physical size appears smaller when it is 
close, and therefore a physically larger 
match is required to establish subjective 
equality with the standard. Apparent 
size thus significantly increases over the 
short distance interval from 2 ft. to 10 ft. 

Furthermore, the average judgment at 
the near position exceeded the physical 
size of the standard and at the far posi- 
tion was exceedingly close to the physical 
size of the standard. It is therefore 
clear that in these judgments there was 
no “compromise” between retinal size 
and objective physical size. On the con- 
trary, in fact, at one of the distances, 
“overconstancy” reliably 
strated. 

These findings complement those of 
Chalmers (1), Gibson (2), Gilinsky (3) 
and Smith (8). The increase of appar- 
ent size with distance found by these 
writers under long-range conditions is 
now seen to take place at very short dis- 
tances also. We may therefore offer a 
tentative generalization by saying that 
objective judgments in the size-distance 
situation appear to follow a trend op- 
posite to the direction predicted by the 
rules of geometrical perspective. 

A further possibility is suggested by 
the present demonstration that a stand- 
ard is significantly overestimated when 
the comparison is very close to §. It 
appears probable that in judging the size 
of an object at a given distance by means 
of a comparison series, there may be for 
the latter a point or range of optimum 
location for greatest accuracy. The 
limits of this function, at different dis- 
tances of the standard, is a matter which 
requires further exploration. 


was demon- 


Hitherto, overestimation of size has 
been explained—e.g., by Smith (8)—by 
error in the estimation of distance. If 
distance is overestimated, it is reasoned, 


size will be likewise overestimated. Pro- 
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gressive overestimation of size with in- 
creasing distance (2,3,8) lends some 
weight to this hypothesis. Such a view, 
while not inconsistent with the data of 
the present experiment, provides a less 
satisfactory basis for explaining short- 
range overestimation of size. Hermans 
(4) has shown that a decrease in perceived 
size is coupled with an increase in bino- 
cular convergence. This finding, also 
consistent with the present results, may 
furnish a more useful point of departure 
for further study of the phenomenon. 


SUMMARY 


The present experiment studied the trend in 
size judgments under objective instructions at 
two short viewing distances. Judgments of a 
standard 20 ft. away were obtained with the 
comparison object presented at two distances, 
from four groups of 12 Ss each. ‘Two groups 
were first required to give judgments with the 
stimuli equidistant at 10 ft. The other two 
groups gave judgments at the disparate dis 
tances, and then made equidistant judgments. 
Two groups made judgments with the sequence 
being 20 ft./10 ft. and then 20 ft./2 ft. The 
other two groups had this order reversed. 
Analysis of variance showed that prior experience 
with the equidistant stimuli had no significant 
effect on the subsequent judgments and did not 
simply interact with either of the other variables 
A highly significant difference between the two 
distances, relatively independent of the effects of 
sequence and prior experience, confirmed the 
prediction that the nearer object would yield the 
larger match, and also that would 
deteriorate from the far to the near position of 
the comparison. 


accuracy 
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The study of possible relationships 
between emotional factors, value 
structures, and personality ‘“‘needs,”’ 
and various parameters of the percep- 
tual process has enjoyed some popu- 
larity in recent years. An aspect of 
perception which has received much 
attention in these studies is that of 
size estimation, stimulated largely by 
Bruner and Goodman’s (1) study in 
which size estimations of coins were 
found to be related to the economic 
background of the children making 
the judgments. That factors other 
than the value of the coins may play 
an important role in producing these 
results is suggested by an experiment 
by Bruner and Rodrigues (3), who 
varied the material of coin-like stim- 
uli, and Bruner and Postman (2), who 
varied the designs on the faces of the 
coin-like stimuli. Evidence from 
these studies suggests that variations 
in these structural properties of the 
stimuli also contribute to differences 
in perception, either in addition to or 
regardless of the associational value 
of the stimuli. But Bruner and Post- 
man’s study is ambiguous in this 
regard, since the designs they used 
were representational of a dollar sign, 
a swastika, and a neutral symbol. 
Thus associational value was con- 
founded with structural (design) dif- 
ferences of the stimuli. 

A study by Singer (4) was the im- 
mediate stimulus for the present 
study. Singer found that by slightly 


' This research was carried out while the jun- 
ior author was a U. S. Public Health Service 
post-doctoral fellow at the University of 
Wisconsin. 


Montana State University' 


changing a circular stimulus design so 
that it could schematically represent a - 
smiling and/or a frowning face, he 
could produce greater size estimations 
than when the stimulus circle con- 
tained a nonsensical design, or was 
blank. Singer found no differences 
between size estimations of the frown- 
ing and the happy faces. He con- 
cluded that meaningfulness (or asso- 
ciation value, as it will be referred to 
hereinafter) is a significant variable in 
size perception. However, it should 
be noted that in Singer’s experiment, 
association value and structural prop- 
erties of the stimuli are not indepen- 
dent of each other. One might inter- 
pret his results as simply indicating 
that two of his configurations of lines, 
those which he called happy and 
frowning faces, are perceived as being 
larger than the other stimuli as a 
result of the manner in which the lines 
of the design curve, creating percep- 
tual phenomena not unlike the Miller- 
Lyer illusion, etc., and that these 
larger size estimations are completely 
independent of the associational value 
of the stimuli. 

The present experiment was de- 
signed to test the same major hy- 
pothesis as did Singer—namely, that 
there are differences in the size estima- 
tions of visual stimuli as a function 
of the association value of the stimuli 
and of the pleasant or unpleasant 
“tone” of the associations. However, 
an attempt has been made in this 
experiment to vary this association 
value independently of the stimulus 
configuration by associating pleasant 
concepts, unpleasant concepts, and 
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nonsense syllables to the same stand- 
ard stimuli. The results of such an 
experiment might also shed more light 
on the question of whether the differ- 
ences in size estimations of the various 
stimuli studied in the experiments 
reviewed above were due to autoch- 
thonous qualities of the perceptual 
process itself, or may be ascribed to 
other functional components of be- 
havior which are called into play after 
the perceptual act has been com- 
mitted, such as those involved in 
reproduction tasks. The present ex- 
periment has been designed to 
eliminate or control these latter vari- 
ables to a great extent, through the 
use of covariance analysis in which 
general response dispositions in mak- 
ing size estimations are set aside. 

It was also hypothesized that there 
would be an interaction between as- 
sociation value and configurational 
aspects of the stimulus figures. Spe- 
cifically, it was expected that figures 
of curved lines should 


consisting 
heighten any effects produced by 
pleasant association value, and that 
angular figures should intensify the 


effects of unpleasant associations. 
These notions are derived from as- 
sumptions commonly made by artists 
and architects regarding the effects on 
the viewer of the curved or angular 
characteristics of shapes and lines. 


MetTuop 


The S’s were 40 undergraduate students at 
the University of Wisconsin. Each S was asked 
to select the five words from the Kent-Rosanoff 
word-association list which had the most pleas- 
ant meanings for him, and five words which were 
the most unpleasant. Following this, S was 
asked to make five practice control settings of 
the size estimation apparatus, using a 5 X 5-in. 
blank white card on a black ground as the 
stimulus. The size estimation apparatus con- 
sisted of a large black L-shaped board mounted 
at desk height before the seated S. Size estima- 
tions were made by S by decreasing the size of a 
white square directly behind the L-shaped board 
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with its face contiguous with the back surface 
of the board. This was done by having S release 
a positioning knob attached to the white square, 
which allowed the square to slide downward at 
a 45° angle until the portion of the square still 
visible within the angle was of the desired size, 
at which point S could turn the knob to hold the 
sliding square in place. The S was also per- 
mitted to move the square up, if he felt he had let 
the slide down too far. The E could then read 
the size of the setting from a centimeter scale 
placed on the back of the sliding white square. 
Settings could range from 0 to 30 cm. The S 
was told to “make the size of the slide smaller 
until it seems to be the same size as the square 
hanging there (the stimulus).” The £ changed 
the stimulus square for each practice setting, 
but the five stimuli were in actuality identical in 
size, color, and position. 

Following these practice, control settings, Ss 
were randomly assigned to the eight experimental 
groups: Curved Pleasant Association, Curved 
Unpleasant Association, Curved Nonsense Asso- 
ciation, and Curved No Associatior’; and Angu- 
lar Pleasant Association, Angular Unpleasant 
Association, Angular Nonsense Association, and 
Angular No Association. Each S in the Pleasant 
Association groups learned to associate one of the 
Kent-Rosanoff words which he had selected as 
pleasant to each of the five stimulus figures. 
The Ss in the Unpleasant Association groups 
learned to associate the words they had selected 
as unpleasant to the stimulus figures. The 
Nonsense Association groups learned to associate 
five nonsense syllables to the five stimuli, and 
the No Association groups were simply shown 
the stimulus figures without associations. The 
method of learning the associations in the various 
association groups was to present each stimulus 
design to S, with its associated word clipped to 
the back of the stimulus card. After the first 
trial, in which each card was shown for 3 sec. 
and then turned over so that its associated 
concept or nonsense syllable was seen by S, S was 
asked to identify the concept or nonsense 
syllable associated with each card as the figure 
was presented. Each stimulus card was shown 
for 3 sec. and, if a correct response was not 
given in that time, then turned over to reveai the 
associated word or syllable to S. The stimulus 
cards with the designs were shown in random 
order for each trial. The Ss learned to a 
criterion of errorless anticipations in three con- 
secutive trials of the set of five designs. The Ss 
in the No Association group were shown the five 
designs for 3 sec. each, for a total of 15 times, 
which compares with a mean number of trials 
in the other three groups of 11.4. 

The stimuli used for size estimations consisted 
of five basic designs derived from the Bender- 
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Gestalt test, and modified so that each design 
consisted only of straight lines. Another set of 
stimuli were then prepared in which the general 
configurations of the first set were maintained 
except that the designs for the second set con- 
sisted entirely of curved lines. For example, a 
square in the angular set was reproduced as a 
circle for the curved-line set. The designs were 
drawn in black ink on white 5 & 5 in. cards, and 
covered an area of about 3 & 3 in. on the cards. 
The stimuli used for the associative learning 
trials were 34 X 34-in. reproductions of these 
cards 

The stimuli for the size estimation trials were 
hung 
20 ft. from S, in a darkened hallway. 
were tested at 


on a black background at a distance of 
All Ss 
about the same hours in the 
evenings in an effort to hold visual cues constant. 

The consisted of 25 
settings of the size estimation apparatus by each 
S, the 25 settings being divided into five massed 
trials, each of which consisted of a different order 
of presentation of the five designs. 


experimental trials 


RersuLts 


ach S’s scores for the data analyses 
were the sums of his size estimation 
settings in each block of five trials. 
Thus represented five 
settings. An over-all analysis of co- 
variance of these data indicates that 
no significant differences among asso- 
ciation groups, or between types of 
designs, were found. 
possibility that any 
ciation value on size estimation might 
be subject to extinction as a function 
of amount of exposure to the stimuli, 


each score 


Because of the 
effects of asso- 


the data were analyzed using only the 


TABLE 1 


ANALYSIS OF VARIANCE OF ERRORS OF 
Estimate, Aut Conprrions, First 
Five ‘Trias (N = 40) 


Source 


Among Association 
Conditions 

Angular-Curved 

Interaction 

Within Samples 


* Significant at .05 level. 
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SUM OF 5 ESTIMATIONS IN CMS 





i 
wo ASSOC 








UNPLEASANT = PLEASANT NONSENSE 


ASSOCIATION VALUE 


Fic. 1. 


Mean sums of first five size estimations. 


first five experimental settings by each 
S (one setting for each design). In 
this analysis, a significant interaction 
was found. Table 1 this 
analysis. 

Figure 1 presents the 
means of the sums of each 
five judgments. The adjusted means 
of the curved and angular design 
groups are similar only in the No 
Association condition. 

As a more specific test of the hy- 
pothesis that there is an interaction 
between the hedonic “tone” of the 
association value of the stimuli and 
the angularity or curvilinearity of the 
stimulus configurations, two more Ss 
were added to each of the Pleasant 
Association groups and to each of the 
Unpleasant Association groups, and 
the data of these four 
analyzed, excluding the 
Association and No 
groups. The results of an analysis of 
covariance on all the settings made by 
these Pleasant and Unpleasant Asso- 
ciation groups (N = 28) did not 
reveal the hypothesized interaction. 
The results of an analysis of the first 
five settings only of Ss in these four 
groups also failed to produce a 
significant interaction. ‘These results 
indicate that the significant inter- 
action reported in Table 2 cannot be 
accounted for by the interaction of 
curvilinearity-angularity with only 


report Ss 


adjusted 
S’s first 


groups re- 
Nonsense 
Association 
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Pleasant Association and Unpleasant 
Association. 


Discussion 


The results of this experiment indicate 
that association value per se does not 


influence size estimations. Only in inter- 
action with configurational characteris 
tics of the stimuli do associations affect 
the perception of size. Further, the re- 
sults indicate that these effects are quite 
unstable, disappearing quite readily with 
increased exposure to the stimuli. 

The interaction of 
pleasant Association 
and Curvilinearity 
direction. 
not statistically significant alone; in this 
experiment, it was also dependent on the 
interaction with 


Pleasant and Un 
with Angularity 
is in the predicted 
However, this interaction is 


Association. 
The results suggest that there is some 
validity for identifying angularity with 
unpleasantness, and curvilinearity with 
pleasantness; the interaction produces 
an illusion of larger size. 


Nonsense 


However, the 
finding that Nonsense Association inter 
acts with the stimulus variables studied 
in this experiment in the same manner 
as does Pleasant Association is not readily 
explainable. There is the possibility 
that the learning of the associations pro 
vided a rewarding, or pleasant, experi- 
ence for the subjects in the Nonsense 
and Pleasant Association conditions, 
whereas the learning experience in the 
Unpleasant Association condition served 
to emphasize the unpleasantness. This 
might account for the similarity of re 
sponse to Pleasant and Nonsense Associ 
ations. 


SUMMARY 


An experiment was designed to test the 


hypothesis that the association value and the 
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pleasantness or unpleasantness of the association 
to stimuli affects the perception of the size of the 
stimuli 


In addition, it was hypothesized that 


curvilinearity or angularity, as stimulus con 
figurational variables, interact with association 
value such that curved figures would enhance 
the effects of pleasant associations and angular 
figures would enhance the effects of un} leasant 
associations. Forty undergraduate Ss each 
made five size estimations by varying a stimulus 
until it matched the standard to provide measure 
on a matching variable. Concepts selected by 
each S as pleasant or unpleasatit were then 
equated with either curved-line or angular-line 
designs which served as the stimuli \ third 
group had nonsense syllables associated with the 
figures, and a fourth group had no associations 
The experimental trials consisted 
of the variable stimulus to 
The 


differences 


of matchings 
each of five design 


stimuli results indicate no significant 


among the groups in their size 
estimations of pleasant, un} lea ant, nonsense, or 
no association figures, or of curved or angular 
figures \ significant interaction between 
curved angular higures and the association vari 
ables was found only for Ss’ first experimental 
judgments of each stimulus. The results are 
interpreted to indicate that association value has 
an unstable effect on size estimation ‘, ce pe nding 
on configurational aspects of the stimuli, and on 


relatively few exposures to the stimuli 


REFERENCES 


Brauner, J.S.,& Goooman,G.C. Value and 
need as organizing factors in perception. 
J. abnorm, soc. Psychol., 1947, 42, 33-44 
Bruner, J. S., & Postman, L. Symboli 
value as an organizing factor in percey 
tion. /. soc. Psychol., 1948, 27, 203-208 
Brauner, J. S., & Ropricurs, |. S. Some 
determinants of apparent size J. ab- 
Psychol., 1953, 48, 17-24 


Personal and env 


norm, s06 
Sincer, J. L. 


determinants of 


ronmental 
perception in a s8ize 
J exp Psy hol ‘ 


constancy experiment 


1952, 43, 420-427 


(Received October 22, 1956) 





Journal of Experimental Psychology 
Vol. 54, 47 5, 1957 . 


SEMANTIC GENERALIZATION 
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One approach to the study of 
meaning has come under the rubric of 
semantic generalization. Such gen- 
eralization is usually indicated by 
positive transfer presumed to result 
from a similarity defined in terms of 
an association between either two 
signs, two objects, or a sign and an 
object. Traditionally, the strict me- 
diated generalization study, following 
an A—B, B—C, A—C, paradigm has 
been excluded as not being “‘semantic 
in character” (2). This leaves situa- 
tions mainly of the sort that may be 
represented by the paradigms, A—B, 
A—C, B—C, and A—C, B—C, A—B, 
facilitation being expected in the 
third task as the result of the other 
relationships specified. 

Foley and Cofer (2) demonstrated 
semantic generalization in a situation 
where the recall of a test list of words 
was facilitated when it followed train- 
ing on a serial list made up of syno- 
nyms and homonyms of the test list. 
Osgood (4), however, points out that 
because of the constant relation be- 
tween the training and test lists, the 
Ss may have reconstructed, rather 
than recalled, the test list once the 
relationship was recognized. 

Peters (5) performed several ex- 
periments following the A—B, C—B, 
A—C paradigm without obtaining 
particularly impressive results, except 
in one experiment where meaningful 
words that bore a part-whole relation- 
ship to each other were used. 


It ap- 
peared, however, that Peters’ learning 
criterion was not sufficiently high if a 
single anticipation trial rather than 


learning were to be used as the 
measure of facilitation. 

Nevertheless, the Peters studies 
suggest that semantic generalization 
could be more adequately studied 
if the essential relationships were 
brought under experimental control 
by using nonsense syllables. The 
semantic relationships that are pre- 
sumed important can then be created 
in the experimental situation. 

An experiment by Bugelski and 
Scharlock (1) on mediated general- 
ization provided a convenient and 
efficient design for the type of study 
proposed. However, rather than use 
only an arrangement of syllables that 
would presumably produce interfer- 
ence with which to contrast the 
facilitation experience, a third subset 
of syllables was added for comparison 
with the effects of both interference 
and facilitation subsets. 

Thus, the following experiment was 
designed to (a) demonstrate semantic 
generalization when the “semantic 
relationships” were under _ experi- 
mental control, and (b) to evaluate 
semantic generalization through both 
transfer and interference effects. 


Meruop 


Apparatus and procedure.—Each S learned 
three lists of 12 pairs of nonsense syllables by the 
anticipation method. These lists were learned 
in three sessions separated by 48-hr. intervals. 
Each list was made up of three subsets of 
syllables, two experimental and one control, so 
that each S provided both experimental and 
control data. The E, subset was four syllable 
pairs which followed the paradigm, A—B for 
Session 1, A—C for Session 2, and B—C for 
Session 3. The second experimental subset of 
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syllables (Ez) followed the same daily sequence 
as did the E, syllables except in Session 3 when 
the syllables of the FE, subset were rearranged so 
that the two syllables making up each pair had 
not been common responses to a single stimulus 
as was the case for the F, syllables in this session. 
The control subset (C) followed the paradigm 
A—B, D—C, B—C, and as can be seen, differed 
from the experimental groups in Session 2 in that 
the C syllables were not associated with A 
syllables but with a new set of stimuli. The 
three subsets of syllables were combined in an un 
predictable order in each list. 

In order to balance the difficulty level of 
various syllables in the various subsets, Ss were 
divided into three groups and lists were prepared 
such that each set of syllable pairs was used in 
each of the experimental as well as in the control 
subset. ‘Thus there were nine lists of 12 syllable 
pairs, one for each group for each of three ses- 
sions. The syllables were taken from Glaze’s 
lists (3) with association values between 80% 
and 87% 

The instructions ones for 
paired-associate learning with the exception that 
Ss were requested to pronounce the syllables 
rather than spell them. The syllables were 
presented with a Gerbrands M-1 memory drum 
and S was separated from EF by a hardboard 
A 3-sec. interval separated the presenta- 
tion of the stimulus and the stimulus-response 
pair, and another 3 sec. elapsed before presenta- 
tion of the next stimulus. A 6-sec. interval 
separated repetitions of the list. A 1-min. rest 
was introduced after each nine repetitions of the 
list unless S was in the process of meeting the 
learning criterion. All lists were learned to a 
criterion of two successive correct repetitions 
followed by three more repetitions of the list. 

Subjects. —The Ss were 21 male students 
drawn from psychology classes. None had had 
previous experience with nonsense syllables. 


were the usual 


screen. 


RESULTS AND Discussion 


The mean number of errors to 
criterion for each group, for each 
subset, is presented in Table 1. The 
subset E,, that was expected to indi- 
cate semantic generalization through 
fewer errors in Session 3 than the 
other subsets, provided results in the 
predicted direction. The interference 
subset (E2), however, did not produce 
a mean that differed from that ob- 
tained for the control (C) subset. 
The analysis of variance summarized 


TABLE 1 


Mean Errors To Crirerion FOR THE 
Dirrerent Groups AND 
SyLLaABLe Suspsets 


Syllable Subsets 


Groups 


Total 


in Table 2 indicates a significant 
Treatment effect (P < 01). The 
group differences, S differences, and 
interaction did not produce signifi- 
cant F ratios. Thus, there are no 
limitations placed on the interpreta- 
tion of the obtained difference from 
the statistical standpoint. However, 
if one tries to interpret these data 
within the usual associationistic inter- 
pretation of mediation phenomena, 
there are theoretical problems. The 
reasoning necessary to the prediction 
of facilitation for E,, also predicts 
interference for Ey, unless some auxili- 
ary process is proposed, or unless one 
can find some reason for rejecting the 
present control group as inadequate. 


There are several ways in which this 
latter possibility may be evaluated. 
Even though the analysis of variance of 
errors in Session 3 showed no group dif- 
ferences or significant interaction with 


TABLE 2 
Awnacysis or VARIANCE or Errors 
TO CRITERION IN Session 3 


Source df MS 
216.0 
431.6 
301.0 
74.2 


Between groups 2 
Error 1k 
Between treatments 
TXG 

Error 


Total 


*P = MO, 
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other variables, the groups were com- 
pared in Session 1. Mean trials to cri 
terion for Session 1 were 20.0, 20.4, and 
21.2 with standard errors of 1.86, 3.47, 
and 1.67, respectively. Thus, there is 
no evidence of differential learning ability 
among the three groups when the learn 
ing conditions were identical. In Ses 
sion 2, the control condition can be differ 
entiated from the two experimental con- 
ditions, inasmuch as it involved associ 
ating the C syllables with a new set of 
stimulus syllables rather than with the 
previously used A syllables. The mean 
errors to criterion per syllable pair were 
8.95 and 9.47 for the experimental condi 
tions and 7.77 for the control conditions 
The appropriate analysis of variance and 
f tests indicate that there are significantly 
fewer errors on the control syllables than 
on the experimental syllables. This 


difference is not surprising inasmuch as 
some interference would be expected in 
the experimental subsets in Session 2. 
Whether this difference has any implica 
tions with respect to the results obtained 


in Session 3, however, is another ques 


tion. Considering the 
tude of 


relative magni 


these means it does not seem 
plausible to suggest that practice effects, 
warm-up, or familiarization, were not 
equalized, and thus this difference, in 
spite of its significance, does not provide 
a suitable explanation for the lack of 
difference between the control and inter 
ference subsets in Session 3. 

The positive results of this experiment 
when compared with the essentially 
negative results obtained by Peters are 
probably not due so much to the different 
paradigms involved, as they are to the 
higher learning criterion, the greater 
efficiency of the experimental design, 
and the fact that learning rather than a 
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single anticipation was used as a criter- 
ion. Further research will be necessary 
to establish whether interference can be 
produced by the semantic generalization 
paradigm. Interrogation of Ss gave no 
indication of awareness of the possibility 
for mediated transfer in Session 3. 


SUMMARY 
Previous studies of semantic generalization 


have used provided in the un 
controlled experience of S. 


associations 
The present experi 
ment used one possible paradigm of semantic 
generalization, and with syllables, 
provided the appropriate associations through 
training. With each S serving as his own 
control, that arrangement of syllables that was 
expected to facilitate performance did so to an 
extent differing significantly from a presumably 


nonsense 


neutral arrangement, and an arrangement ex 
pected to produce interference. ‘The latter two 
training procedures, however, did not produce 
differential learning on the critical task 
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Hull (9) postulated that behavior 
strength is a multiplicative function 
of a learning factor, habit 
(sl[g), and a motivational factor, 
generalized drive (D). s// x is defined 
by changes in behavior due to prac- 
tice; 


strength 


D is defined jointly by greater 
energization of 
such 


behavior following 


antecedent conditions as de- 
privation of food and noxious stimula- 
Hull then hypothesized that 


the total effective drive strength (D) 


tion. 


operative at a given moment consists 


of the (D) 
plus the irrelevant drive strength (D) 


relevant drive strength 
derived from all needs coexisting in 
D is contributed by 


which is 


the organism. 
the need 
criterion behavior, 
tributed by 
torically 
behavior and 


reduced by the 
whereas D is con- 
needs which are his- 
unrelated to the criterion 


whose existence is de- 
fined by independent operations. 
Hull 
the drive summation hypothesis from 
studies by Perin (13), Williams (20), 
Zener and McCurdy (22), Finch (8), 
and Skinner (17). Following these 
early which dealt 
with the problem, 


adduces evidence to support 


studies, indirectly 
experiments were 
specifically designed to test this hy- 
pothesis. ‘These studie s divide them- 
selves into those conce The fd ruth, con- 


ae 
summatory behavior and those de valing 


with preparatory behavior. 


! This article is a part of a Ph.D. dissertation 
submitted to the Faculty of the 
Louisiana State University. The 
debted to Dr. Clyde E. Noble 
direction the research was performed 

?Now at the State Colony and 
School, Pineville, Louisiana. 


Graduate 
author is in 
under whose 


Training 


», independent 


Siegel and Siegel (15), Amsel and Maltzman 
(3 Siegel and Brantley (16), and Wohl (21) 
report evidence for 
In each of these stu tionality 
faradic 
relevant need 
lefined the relev 
sumption or water consumpt« 
behavior Ellis, Siegel, 


ant need 


positive drive summation 


lies em , induced by 
stimulation, was en | loyed a an 
and either food or water ne pat 
ant need Ly vd ( 
mwa Ae crite 


and Wohl (7 », USING & 


hunger f 


tion 


less intense reley und nega 
tive evidence for the hypothesis 

Several tudies have 
effects of D upon preparatory behavior I’x 
periments by Koch and Daniel (11), Siegel (14), 
and Strange (18) yield data which do not 
support Hull’s hypothesis. Stud Kendler 
(10) and Amsel 1) ‘wative and 
positive evidence, depending upon tl 
utilized and the intensity of the 
Webb (19) and Brandauer (5 
ments which yield 
both drive summation and 

Miller (12) has rey 


' 
vest that an 


been concerned with the 


ics b y 
report both ne 
i¢ need state 
need state 
port 
hiindings only, 
ibstitution 


rted tw tud 


experi 
positive 


relevant need in 


need ha 


The 


ment is 


been eliminated 


D used in the 
called emotionality, 


present experi 
viewed a 
a perseverative response following 
motivational 
D is generated by food 


The criterion behavior se 


pain and _ possessing 
properties : 
privation. 
lected is start-box latency and running 
AC h 
tested at 


strong levels y ielding 


time in a linear, alley maze. E 


drive condition has been 
zero, weak, and 
conditions for 


The 


summation and/or 


unique drive nine 


groups occasion 


for drive substitu- 
tion has béén provided under condi- 
tions which appear to constitute a 
fair test of the hypothesis. The con 
ditions have Siegel 


follows: ‘‘(a) the 


been outlined by 
Brantley as 


level is not 


and 


drive already maximal, 


(b) the appropriate stimulus occasion 
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for the elicitation of the response is 
presented, and (c) stimuli eliciting 
competing responses are absent”’ (16, 


p. 304). 


MetTuop 


Subjects.—The Ss were 126 male and female 
albino rats of the Harlan strain. Their ages 
ranged from 80 to 90 days at the beginning of the 
experiment, and their weights from 122 to 258 
gm., with a mean of 187.3 gm. The Ss were 
housed in groups of four in cages of approxi- 
mately 20 K 10 XK 10 in. and were maintained 
on Purina Dog Chow Checkers and water ad lib. 

Apparatus.—Two linear mazes and a shock 
box constituted the apparatus. The maze 
utilized for pretraining had a start box and a goal 
box of 18 K 11 XK 10 in. inside and a 30-in. 
straight alley, the inside dimensions of which 
were 4X 4 in. The start box was painted flat 
white; the alley and the goal box, flat black. 
Both the start box and the goal box had guillo- 
tine type doors which, when opened, activated 
microswitches controlling two electric clocks. 
These clocks measured start-box latency and 
running time to the nearest .Ol sec. The train- 
ing-test maze was identical to the pretraining 
maze except that the alley was 17 ft. 6 in. 

The shock apparatus consisted of a 20 XK 15 
X 14 in. compartment, the walls of which were 
painted with black and white diagonal stripes. 
The top was plate glass. The grid floor was 
made of brass rods 7% in. in diameter and spaced 
approximately 4 in. apart. A shock current was 
provided by a constant D.C. source with a rated 
output of 805 v. A l-megohm resistor was 
placed in series in the circuit in the instance of 
“strong” shock to control current intensity and 
to minimize changes in S’s resistance. For the 
“weak” shock a 2 megohm resistor was used. 
A Hunter-Brown electronic timer set at .5 sec. 
controlled the shock duration. 

Procedure.—The experiment was conducted 
in two sessions, each lasting 44 days and consist- 
ing of pretraining, training, and a test. ‘The 
pretraining regimen was designed to establish 
specific eating habits, adapt Ss to handling, and 
familiarize them with the maze environment. 
For the first 14 days Ss were fed wet Purina 
mash and handled during a I-hr. period each 
day. Water was available on the handling 
tables as well ‘as in the home sage. Following 
this phase of pretraining, they were given one 
trial daily for eight days in the short maze with 
wet mash available for 45 sec. in the goal box. 
During this phase the 23-hr. food privation 
regimen was maintained. 

At the termination of this 22-day period the 


training ptoper was begun. Training was 
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identical to the latter phase of pretraining except 
that the long maze was used and the training 
was continued for 21 days. Two measures were 
recorded for each training trial: latency and 
running time. Latency was defined as the time 
interval commencing 10 sec. after S was placed 
in the start box and ending when the start box 
closed, i.e., when S was in the alley and just 
clear of the door. The Ss were placed in the 
start box facing opposite the door. Running 
time was defined as the interval between the 
closing of the start-box door and the closing of 
the goal-box door after S had entered. 

The criterion test was conducted on the day 
immediately following the last training day. 
At this time the 126 Ss were equally and ran- 
domly assigned to the nine cells of a 3 XK 3 
factorial design. No sex bias resulted from this 
procedure. One dimension of this design con- 
sisted of 0, 12, and 23 hr. food privation, D. 
The other represented zero, weak, and strong 
shock, D. ‘Thus the nine groups will henceforth 
be designated O-n (none), O-w (weak), O-s 
(strong), 12-n, 12-w, 12-s, 23-n, 23-w, and 23-s 
where the numerals refer to the intensity of D 
and the letters to D. 

Groups 23-n, 23-w, and 23-s were tested under 
the hunger need that prevailed during training 
Groups 12-n, 12-w, and 12-s were allowed to eat 
for | hr. exactly 12 hr. prior to testing. An 
intensive effort was made to insure that groups 
O-n, O-w, and O-s were satiated at the time of 
testing. ‘This entailed allowing Ss access to wet 
mash for 24 hr. or until they had stopped eating 
for 5 min., 
mediately before testing. 


whichever time was longer, im 

Qualitatively, all Ss 
in these groups appeared lethargic and fairly 
quiescent. 

The treatments designed to induce emo- 
tionality were intermittent faradic stimulation 
delivered immediately prior to the criterion maze 
run. Groups receiving no shock were simply 
exposed to the shock box for 20-sec. Those 
receiving weak shock were given 10 shocks of .35 
mamp. during a 20-sec. interval and Ss getting 
strong shock, 10 shocks at .67 mamp. during the 
20 sec. 

Several precautions were taken to prevent 
generalization from stimuli associated with the 
shock situation to stimuli in the test environ 
ment. Some of these were: (a) the shock 
procedure was carried out by an E who had no 
other contact with the Ss, (b) the E who ad 
ministered the shock wore black rubber gloves, 
and (c) Ss were shocked in an environment other 
than the test situation. The exact procedure 
consisted of the shock-E taking S from his cage 
and placing it in the shock box. Then it was 
handed to the test-E and immediately placed in 
the start box. Four to 8 sec. elapsed between 
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TABLE 1 


Latency on Test TRIAt 


D 


Weak Shock (w) 


No Shock (n) Strong Shock (s) 


Food 
Deprivation 


0 hr. 
12 hr. 
23 hr. 


the end of the last shock impulse and the placing 
of S in the start box. 


REsULTs 


Table | presents median and mean 
latency for each of the nine groups on 
the test trial. Though latency raw 
scores are slightly skewed, the assump- 
tions required for an application of the 
analysis of variance are not seriously 
violated. Consequently, this statistic 
was used. The results of this analysis 
are presented in Table 2. It is 
clearly apparent that the intensity of 
hunger is inversely related to latency 
(F = 3.49, P = 05). Emotionality 
failed to influence latency  signifi- 
cantly, although there is a slight trend 
in the expected direction for the 0- 
and 23-hr. groups. There was also no 
significant interaction. <A further test 
of the effects of emotionality, which is 
perhaps even more critical, compared 
the test performance of the 23-n, 


TABLE 2 
ANALYSIS OF VARIANCE 


or Latency Scores 


Source 


dD 

D 
DD 
Within groups 
Total 


*P = 05, 


2.05 


SD | Mdn. 


1.13 
0.92 


23-w, and 23-s groups with their 
performance on the last training day. 
Groups 23-n and 23-w showed mean 
increases in latency on the test day 
of .68 sec. and .40 sec., respectively, 
whereas Group 23-s showed a mean 
decrease of 1.06 but ¢ tests 
indicated that these differences could 
be attributed to chance. A statistical 
comparison of the scores of 
Groups 23-n, 23-w, and 23-s apart 
from the other groups is logically 
contraindicated by the yesults of the 
previous analysis of variance 

Table 3 gives mean and median 
running time for each of the treatment 
groups on the test trial. The raw test 
scores were nonsymetrically distrib- 
uted within cells and the between-cells 
variance was heterogeneous In order 
to use the analysis of variance these 
data were converted to sten 
(6), which yielded normal distribu 
tions and homogeneity of 
between cells. Table 4 summarizes 
the analysis of variance. The F for 
D is significant beyond the .01 level; 
the F for D is less than one. 

The interaction variance yields an 
F which is just short of the 2.45 
needed for the .05 level. Reference 
to Table 3 shows this interaction 
derives mainly from the differential 
effects of emotionality upon the 23-hr. 
groups when compared to the other 
groups. It may be seen that running 
time is directly related to the intensity 


sec., 


test 


scores 


Variance 
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TABLE 3 


Runninc Time on Test Triar 


D 


Food 
Deprivation) 


No Shock (n) 


Mean 
102.5 
39.9 
13.7 


of emotionality for the 23-hr. Ss, but 
tends to be inversely related for the 
Q- and 12-hr. Ss. In this instance, the 
near significant interaction variance 
offers some justification for an inde- 
pendent analysis of the differences 
between the 23-hr. Ss alone. Such an 
analysis appears even more appro- 
priate if it is remembered that these Ss 
were tested under conditions which 
most nearly simulated training. 
Therefore a simple analysis of vari- 
ance was performed on the trans- 
formed data (sten scores) of groups 
23-n, 23-w, and 23-s. The F of 2.34 
with 2 and 39 df for between-groups 
falls far short of significance. The 
test performance of these groups was 
slightly poorer than that on the last 
training day. ‘The median (raw 
score) increased from 6.4 to 9.9 sec. 
this 
difference was statistically significant. 


A simple sign test indicated 


TABLE 4 


ANALYSIS OF VARIANCE OF 
Runninc Time Scores 


Source MS P 


100.0 | 38.46** 
0.5 
6.0 2.30 
Within groups 2.6 
Total 


-P = Ol, 


Weak Shock (w) 


Madn. 


| 


| 
| SD 


a | 
34.87 
| 10.53 | 


Discussion 


Perhaps our most striking result is the 
effect D reduction had 
time. 


upon running 
The reduction of D to zero roughly 
doubles running time, and, significantly, 
four Ss in these groups did not reach the 
goal box during the allotted 6 min. (A 
score of 6 min. was assigned to these S's 
for statistical purposes.) The effect pro 
duced by the reduction of D to 12 hr. is 
less pronounced. However, the mean 
running time for these groups combined 
is approximately equal to that at the 
beginning of training. Though there is 
a significant increase for the 23-hr. Ss, 
this change is not appreciable in view of 
day-to-day variability. Whether the 
large decrement for the O- and 12-hr. 
groups is due to a loss of the cue function 
or the energy function of D cannot be 
determined at this time. 

To anticipate an argument which may 
be advanced, the change in procedure on 
the test day could have had a general 
disruptive effect upon performance. 
Such a general hypothesis could account 
for the over-all decrement in both run- 
ning time and latency. Nevertheless, it 
would seem that some state of affairs 
must carry over to the maze situation in 
order to affect performance. Our only 
answer to this criticism would be that 
the test was specifically designed to 
minimize transfer of directive factors 
and to maximize energy transfer. Fur- 
thermore, a technique similar to the pres- 
ent one has been used successfully in 
several past studies (3, 12, 15, 16). 

In general, the trend in the latency 
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scores on the test is similar to that of 


running time. However, the observed 


increase is not statistically significant. 
The behavior of Group 0-n proves inter- 


esting in view of Hull’s original formula- 
tion of the generalized drive notion and 
the related Koch and Daniel study. It 
may be recalled that Hull extrapolated 
a curve depicting Skinner-box behavior 
to a point at which Ss would be perform 
ing under zero relevant drive, and found 
that behavior strength was appreciably 
above zero. He attributed this residual 
behavior strength to the operation of D. 
Later, Koch and Daniel (11) tested the 
validity of the extrapolation by placing 
satiated animals in a similar situation, 
They found that when D is reduced to 
zero, six is likewise reduced almost to 
zero. This finding is not clearly sup 
ported by the present data. Though the 
0-n Ss were very slow, 13 out of the 14 in 
this group reached the goal box within 6 
min. Five of these Ss were observed to 
nibble food. Qualitatively, however, the 
typical behavior of S in the maze follow- 
ing exposure to the shock apparatus did 
not appear to be goal directed. Specifi- 
cally, they had a mincing gait in the alley, 
were inclined to urinate, and appeared 
acutely disturbed. For these reasons we 
tend to believe simple exposure to the 
shock apparatus induced mild emotional 
ity, which, according to the generalized 
drive hypothesis, should have energized 
goal-directed behavior in this situation. 
It is felt that this particular result offers 
no clear evidence for or against the hy- 
pothesis. 

The present data, viewed in their en 
tirety, cannot be offered in support of 
Hull’s generalized drive hypothesis. In 
those instances where D is low, D should 
facilitate behavior more than when D is 
high. The near-significant interaction 
obtained from the analysis of the running 
time data suggests a converse effect. It 
may be noted, by referring to Table 1, 
that median running time for the 23-hr. 
Ss is directly related to the intensity of 
D whereas the relationship for the 0- and 
12-hr. Ss tends to be inverse. No ap- 
preciable interaction is evident in the 
latency data. 
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Several ad hoc explanations could be 
offered for the present negative findings. 
Perhaps the utilization of emotionality 
as a drive would bear careful scrutiny in 
view of the fact that many theorists 
have considered it a disruptive influence 
rather than a motivational state. Sec- 
ondly, the possibility of competing re 
sponses cannot be ignored. Amsel (2) and 
Amsel and Cole (4) have clearly shown 
that, if competing responses are condi 
tioned to cues associated with D, the 
criterion behavior will undergo a decre 
ment. 

Finally, the criterion behavior 
prove central to the problem. Those 
studies concerned with consummatory 
behavior have generally reported positive 
findings. On the other hand, positive 
evidence for a facilitative effect of D 
upon preparatory behavior is rare 
Kendler (10) alone found positive evi 
dence for a drive summation in a Skinner 
box situation, and this finding was con 
tingent upon the intensity of D. Webb 
(19) reported positive results when the 
criterion 


may 


behavior is door opening be 
havior, although he used two need states 
which may be physiologically 
founded. Amsel (1) failed to observe 
an increase in maze performance when a 
hunger need was added to a motivational 
complex in which the need state was emo 
tionality. Behavior facilitated 
when the dominant state 
changed to anxiety. Only Miller 
reported positive results from a 
which resembles the present one. 

In conclusion, the present data cannot 
be offered in support of Hull’s concept of 
generalized. drive, i.e., substitution or 
summation. On the other hand, to pro- 
pose that they constitute evidence against 
the hypothesis implies confidence in the 
fairness of the test, which we do not feel 
is entirely justified. 


con 


was 
need was 


(12) 
test 


SUMMARY 


{In order to test Hull’s generalized drive 
hypothesis, 126 rats were trained to run a linear, 
alley maze while under a 23-hr. hunger need. 
Following 21 daily training trials, Ss were 
assigned to nine treatment groups consisting of 
the ‘possible combinations of 0, 12, and 23 hr 
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food privation, as a relevant need, with zero, 
weak, and strong faradic stimulation as an 
irrelevant need. The “test” consisted of a trial 
The state induced 
by the faradic stimulation was called emotion- 
ality and 
properties, 


similar to a training trial. 


viewed a8 possessing motivational 
Both start-box latency and maze- 
running time were utilized as criterion behaviors. 

The findings were mainly negative. The 
criterion behavior was not significantly in- 
fluenced by the introduction of an irrelevant need 
into the motivational complex. As expected, 
changes in the relevant need state did signifi- 
cantly affect latency and running time. A near 
significant relevant need times irrelevant need 
interaction was concluded 
that the present data provided no definite evi 
dence either for or against the theory. 


observed, It was 
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EMOTIONALITY AND THE YERKES-DODSON LAW 
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The Yerkes-Dodson Law (17) which 
states that the optimum motivation 
for a learning task decreases with 
increasing difficulty has been shown 
to hold for several species. See 
Young (18). It is the purpose of the 
present experiment to extend the 
range to include the rat. However, 
individual differences in drive strength 
are clearly important here, as may be 
seen from the current work showing 
the differential effect of anxiety on 
human learning (16). Recent evi- 


dence (8) supporting the identification 
of neuroticism as an autonomic drive 
(7) suggests that emotionality, as 
defined by defecation scores in Hall’s 
open-field test (10), may be a relevant 
variable, since the lability of the 
autonomic nervous system probably 


underlies both neuroticism in humans 
and emotionality in rats (6, 10). 
Emotional rats will therefore be ex- 
pected to show greater drive than non- 
emotionals in a situation in which the 
motivation used (air deprivation) is 
intense and of a kind likely to give rise 
to fear responses, and consequently to 
learn faster when the task is easy, but 
more slowly when it is hard. That is 
to say, the optimum drive level in the 
Yerkes-Dodson situation should be 
lower for the emotionals than for the 
nonemotionals. We may therefore 
predict an interaction between emo- 
tionality and the two variables of 
motivation and difficulty level which 
should themselves interact in the 


manner suggested by the Yerkes- 
Dodson Law. 


MetTHop 


Apparatus.—The Y-shaped discrimination 
apparatus designed by Jonckheere (12) for sur- 


face swimming was adapted for underwater use’ 
This is a conventional Y unit with metal walls 
2 ft. high and alleys 4 in. wide. The stem of the 
Y is 7 in. long and the arms make an angle of 
52° to the center line. Vertical partitions across 
each arm, 7 in. beyond the bifurcation, extend 
from top to bottom of the alley and contain the 
discrimination panels. The Ss were forced to 
swim through the unit completely underwater 
by roofing in the alleys before the partitions with 
stainless steel hardware cloth just below the 
water level (9 in.). The discrimination panels 
were hung vertically and hinged below the 
surface of the water. They were made of 
frosted Perspex and each presented an illumi- 
nated area, 34 in. square, at right angles to the 
line of sight from the bifurcation. Either could 
be locked into place, or left free, to be opened 
underwater by S to allow escape to the part of 
the alleys beyond the partitions which were not 
roofed in below the surface. From here, hard- 
ware cloth ramps gave access to a platform above 
water level. 

The illumination was provided by two lampsin 
parallel in a 24-v. circuit. One lamp was located 
above water level behind each discrimination 
panel with its filament parallel to the panel. A 
fixed portion of a wire resistor could be switched 
into the circuit of either lamp in order to dim the 
intensity of the light shining through the 
adjacent panel, the other lamp being undimmed. 
The S was thus presented with a choice between 
a well lit and a less well lit avenue of escape from 
underwater. ‘Three amounts of resistance were 
used; the greater the resistance the dimmer the 
light, yielding a greater difference between the 
illumination of the alleys, and an easier dis- 
crimination. Measurements with a light meter 
(Avo No. 2) indicated that this difference in 
illumination could be represented, in ascending 
order of difficulty, by the ratios 1: 300, 1:60, and 
1:15. These constitute the three levels of 
difficulty and are designated easy, moderate, and 
difficult. 

A submersible cage, measuring 9 KX 6 X 44 in. 
wide, also made of stainless steel hardware cloth, 
was fitted with a sliding guillotine door which 
could be released from above the surface, thus 
allowing escape through the discrimination ap- 
paratus. Following a suggestion by Mason and 
Stone (14), the intensity of the air deprivation 
used as motivation was manipulated by detain- 
ing the Ss for different lengths of time submerged 
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10 15 20 
Secs. Delay 
MOTIVATION 


Fic. 1. The relationship between speed of 
swimming a 4-ft. straightaway underwater and 
intensity of imposed motivation (air deprivation) 
measured by the number of seconds delay under- 
water before release. Each point represents the 
mean time for 20 Ss; the data weve collected on 


successive day Ss 


underwater before releasing them to make the 
discrimination on which the promptness of their 
escape depended 
female 
Fig. 1. There is clearly no advantage in using 
delays longer than about 10 sec 
ance is not thereby improved 


Preliminary work, using 20 
rats as Ss, gave the results shown in 
, since perform 
Indeed, delays 
result in decreased swimming 
speed, presumably because of the effects of 
anoxia. Delays were therefore kept short, and 
delays of 0, 2, 4, and & sec 


longer than 20 sec. 


before release were 


selected, which represent approximately equal 


(see Fig 1). 
These constitute the four levels of motivation 
Five replications of the 3 & 2 &K 4 


design (24 


increments in swimming speed 


Subjects 
factorial used 
T hese 


120 Ss were male albino rats which formed part 


treatments) were 


that is, a total of 120 Ss was required. 


of the second and third generations in a selective 
breeding study of emotionality being conducted 
in this laboratory (3). When Ss averaged 105.5 
days of age (SE + .25), they 
modified and 


were given the 
open-field test 
Briefly , Ss 


per day for four suc 


standardized 
described in detail elsewhere (1) 
were exposed for 2 min 
cessive days in a circular arena 32} in. in 
diameter with white plywood walls 124 in. high. 
\ battery of loudspeakers and of photographic 
lamps above the arena provided sound (“white” 
noise) and light fields whose intensity at floor 
level in 78 db. (ref. .002 


The 


the arena averaged 
dynes/sq. cm.) and 165 ep., respectively 


‘ 
average number of fecal boluses deposited per 
day constitute the emotional reactivity score, 
and only Ss scoring 3.3 or more were assigned to 
+ 15), and 
only Ss scoring 1.3 or less were assigned to the 
These 


two groups constitute the two levels of treatment 


the emotional group (mean = 4.1, 
nonemotional group (mean = .8, +.17) 
difference 


between them is significant beyond the .1% 
level by t test 


in the emotionality variable; the 


The populations from which Ss were selected 
by virtue of their emotional elimination scores 
had been bred by brother X sister mating from 
a heterogeneous Wistar stock They had been 
reared standard conditions which are 
described fully elsewhere (3). In brief, these 
conditions featured controlled temperature and 
light/dark cycle n 


under 


standard living 


standard conditions of husbandry 


quarters, 
diet, routine 
and the 
Some of them had been 


care, number of animals per cage, ete 
minimum of handling 
interval 
field and in the 
underwater discrimination unit, which interval 


used for breeding purposes in the 


between testing in the open 
itself varied with Ss’ generation, but there is 
evidence (2) to suggest that neither the sexual 
experience, nor age differences of the 
encountered, are likely to affect Ss’ emotionality 
as measured, 


order 


The mean age when experimenta 
tion began of the 45 Ss belonging to the second 
322.0 days (+6.09), and the 
of the 75 Ss belonging to the third 
generation 213.4 days (+3.31) 


is highly 


generation was 
mean age 
This difference 
(P< OOl by t test); 
possible effect of this age 


significant 
accordingly, the 
difference 


upon the scores analyzed was in 


vestigated. ‘There was no significant difference 
by ¢ test between the over-all means of the two 
used (discussed 


groups for the learning score 


later), but, as might be expected, the older Ss 
(P < Ol, by t 
It need not, however, be anticipated that 
any systematic bias is thereby introduced into 


swam significantly more slowly 
test). 


the analy 518 of the speed score. since Ss had been 
As a check, 
a x’ test of the proportions of Ss of the two age 


assigned to treatments randomly. 


groups assigned to each treatment combination 
showed no significant difference from that exist 
ing in the total group (45:75). 

Procedure.—The random assignments of Ss 
to the 24 treatment combinations was made by 
forming the 60 emotional Ss into 12 groups of 
five each, equated as far as possible with respect 
to defecation scores, and the 60 nonemotional 
Ss into a further 12 groups of five each, similarly 
The groups thus formed were then 
within the 
dichotomy, to the various treatment combina 
f 24 (one 


successive days 


equated 


randomly assigned, emotionality 


tions. The Ss were tested in groups « 


complete replication) on 15 





EMOTIONALITY 
The first five days were dev 
training dur 
surface and underwater swimn 
ch time the appropriate 
detention periods for the 


different Ss 
established, then underwater 


ted to I relimina y 
ing which Ss were successiv« 

ng practice 1 
3-ft. straightaway, at wl 
were 
practice in the 
them with the 
opening vors formed by the 
During this time the 
f the ¥ uni 
blocked at the bifure: ! y a solid partition, 


discrimination unit to familiarize 
yperation ol 
discrimination panels 
doors were ul lit, and ) side was 
the other, in 
{ osition 
was thus 
use la 
f this training 
mmptly on 
release, and without 
lifficulty 
On the next 10 
total of 100 trial 
with a choice 
in illumination between 
llumination was present 
containing the water tank 
rding lamy at ! i rac 
self-draining metal 
n the intertrial 
maintained relat 
te the var 
in larger maller 
Prous brighter side of the 
rrect and kept open; an S 
the darker Je 


where panel was 
locked was thereby 


init was designated « 
choo ing 
forced to retri its way to 
the brighter de in or r to escape That is, the 
method was used. The bright side 
was selected since it w constant for each of the 
three levels of ditheul 
no tendency for Ss to prefer one side over the 
ther Thus, o h irst trial upon 
56.7% went to the 
$3.3% 


diffe 


correction 


Vy. and because the re was 


to the dimmer 
rence from chance 

The 
m trial 
10 Gellerman 


y varied fre 
selection of 
used in the 


same 


water temperature was 


noment when the d 
released until S’ 


he surface f the water beyond the 


luminate 1 panel thr 
1 to the 


/ ‘ 
iehined as ar er 


ugh which it had pass 1 was 


nearest I watch, 


of S’s head and 


AND THE YERKES-DODSON 


LAW 


The five replicati ns 
tested at intervals 
by a partial replicat 
stitutes for the 
experimentation 
excluded 
habit \ | ition Nabit 


hree 
th 


and the 
were because they 
as the choice on tw 


last five da 


purpose 
more during the 


side——left or right-—exclusively 


RESULTS 


The number of errorless trials out of 
the 100 trials given was counted for 
each S and the data subjected to a 


three-way analysis of variance after 


Pearson and Hartley’s test (15) had 
disclosed no significant inhomogeneity 
The 
Table | 
the difficulty 
effects were calculated by 
Difficulty * Motivation 


Variance 


results 
The / 


motivation 


of Variance are pre- 


sented in ratios for 
and main 
using the 
interaction 
the 
significance of the interaction between 
The nature of 

) 


this interaction is indicated in Fig. 2. 


estimate, in view of 


these two variables 
Table 2 shows the mean scores and 
SD’s for the various levels of the two 
treatments main 
effects. the efh- 
of the experimental manipulation 


significant 


These data indicate 


having 


cacy 


\ ariables 


of these two independent 


TABLE 1 
oF VARIANCE LeAR 
AND SPEED Scort 


Difficulty 00.2 

Motivation 144 
114.1 
oH1 
44.9 
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Fic. 2. A three-dimensional surface show 
ing the relationship between learning scores in a 
discrimination task and (a) the intensity of the 
imposed motivation (air deprivation) measured 
by the number of seconds’ delay underwater 
before release and (b) the level of difficulty of the 
task. ‘The lamina are spaced to represent the 
over-all mean score for the appropriate difficulty 
level (see ‘Table 2). Each point represents the 
mean score for 10 Ss. 


Thus, each increase in the difficulty of 
the discrimination yields a_ signifi- 
cantly lower learning score, reaching 
the .O1 level (by 1-tail ¢ test) in the 
case of the easy vs. moderate differ- 
ence, and the .0O1 level in the case of 


the moderate vs. difficultone. Increas- 
ing the time of air deprivation first 
increased the mean learning score 
significantly (O- vs. 2-sec. delay), then 
caused a significant decline from the 
peak value (2- vs. 8-sec. delay; 
P < .05 in both cases by ¢ test). 
This latter finding is comparable with 
that of many workers reporting a 
curvilinear relationship between moti- 
vation and performance when in- 
creasingly intense motivation is em- 
ployed (see also Fig. 1). 

It is clear from these results that the 
Yerkes-Dodson Law may be taken as 
confirmed. The optimum motivation 
for a discrimination task demon- 
strably decreases with increasing diffh- 
culty of the task. This effect, repre- 
sented by the first-order interaction 
between the difficulty and motiva- 
tional treatments, is significant at a 
satisfactory level, and part of the 
prediction is thus verified. It is 
equally clear that the prediction, as 
made above, relating to the effects of 
emotionai reactivity on this complex 
relationship is not fulfilled. The 
trend of the results is partly in the 
predicted direction. ‘Thus emotionals 
learn the easy discrimination faster 
than nonemotionals, and are about 
equal on the moderately difficult one, 
and similarly learn faster under low 


TABLE 2 


Breaxpown or LearRNinG Scores 


Motivation 
avel 
(Air 

Deprivation) 


Moderate 


Mean 


O sec. 
2 sec. 
4 sec. 
8 sec. 





All 


Difficulty of Discrimination 


Difficult 
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motivation, and are about equal at 
medium levels. But on the difficult 
discrimination or under the most 
intense motivation used they again 
show a slight superiority. Conse- 
quently, the emotional Ss show an 
over-all superiority in learning, which, 
however, does not reach significance, 
and this is reflected in the failure of 
the one second-order and the two 
remaining first-order interactions, all 
of which involve the emotionality 
variable, to reach significance. 

If it is conceded that the curvilinear 
relation involved makes such a pro- 
cedure permissible, the partial success 
of the prediction relating to drive 
level and emotionality can readily be 
demonstrated. An analysis of vari- 
ance based on the first two lowest 
levels of motivation only—that is, 
before the decline in learning asso- 
ciated with motivation greater than 
the optimum begins (see Fig. 2)— 
shows two significant effects. The 
first is the expected and highly 
significant one associated with level of 
difficulty, the second is the first-order 
interaction between Motivation and 
Emotionality (P < .05). This effect 
may be summarized by saying that 
the emotional Ss show superior learn- 
ing under the minimal motivation 
used, but that this advantage dis- 
appears with the increase of motiva- 
tion from O to 2 sec. delay before 
release. 

Another way to investigate the 
relation of emotionality to drive level 
is to consider the speed of the response 
made to the motivational stimulus, 
irrespective of whether or not it re- 
sulted in improved learning. In order 
to derive a speed score for each S, and 
one which reflects its speed of swim- 
ming independent of the degree of 
learning attained, the time scores for 
the first day upon which all 10 trials 
were recorded as correct were aver- 
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aged. In this way the individual Ss’ 
speeds may be compared at exactly 
comparable points on their respective 
learning curves. Since the distance 
swum in the apparatus is essentially 
the same for each S making a “cor- 
rect’”’ choice (21 in.), the time scores 
can be used directly for the purpose, 
as was done in preparing Fig. 1, thus 
avoiding the difficulty relating to 
averaging time over distance scores 
discussed by Crespi (5). For the 13 
Ss assigned to the difficult discrimina- 
tion who never learned it to perfection, 
the average of the last day’s trials was 
calculated, omitting times for those 
trials when incorrect choices were 
made. An analysis of variance per- 
formed on these data after no signifi- 
cant inhomogeneity of variance had 
been demonstrated gave results which 
are also shown in Table 1. From this 
analysis it will be seen that the 
superiority of the emotional Ss over 
the nonemotional Ss in speed of 
swimming reaches satisfactory sig- 
nificance. The mean time scores are 
1.80 and 1.96 sec., respectively. In- 
serting these values on a plot of 
swimming speed against motivational 
delays (like Fig. 1) shows that the 
difference of .16 sec. is equivalent to 
the effect on over-all speed scores of 
an increase in motivational delay of 
about 4 sec. Moreover, this effect is 
not significantly associated with any 
of the other variables in the analysis, 
from which it can be concluded that 
the emotionally reactive rats display a 
characteristically higher level of drive, 
which, within the limits of the present 
experiment, is not affected by the 
intensity of the motivation imposed. 
It may also be noted from this analysis 
that the absence of a significant effect 
for Difficulty shows that the speed 
score selected was in fact independent 
of the degree of learning, and that the 
significance of Motivation confirms the 
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efficacy of the experimental manip- 


ulation of the air deprivation vari- 
Thus, the mean time taken to 
swim 21 in. underwater 


able. 
decreased 
progressively from 1.98 sec. for O sec. 
delay to 1.97 for 2 sec., to 1.85 (4 sec.), 
and finally to 1.71 (8 sec.). The SD’s 
48, .33, .31, and .25 sec. re- 
spectively. Only the differences be- 
of the first three levels 
(0, 2, and 4 sec.) and the last one 
(8% sec.) reach significance (.05 level or 
beyond by I-tail t test). 


are 


tween each 


Discussion 


The confirmation of the Yerkes-Dod- 
son Law reported here gains in interest 
because it employed a different species 
from those hitherto used in demonstrat 
ing the principle, and because it occurred 
under rather different experimental con 
ditions. The motivation in particular 
was very different from that used in any 
of the other studies of the Yerkes-Dodson 
Law; the electric shock previously em 
ployed as an aversive stimulus was es 
chewed because of the difficulty of con 
trolling it, and because of its disruptive 
effect, especially when strong, upon early 
learning. Air deprivation as used shares 
with shock the quality of being a rather 
intense stimulus, and the advantage of 
being manipulable within the 
situation, It has, in addition, 
shown in practice to be easily varied in 
intensity. 

It is true that in both this and the 
original study a visual brightness discrim 


testing 
been 


ination was used as the method ot varying 
the degree of difficulty of the learning 
tasks. The resemblance hardly goes be- 
yond the semantic, however. The simi 
larity between the actual pieces of ap 
paratus used is slight. The Yerkes box 
has been described as “spectacularly in 
efficient” (13), whereas the underwater 
discrimination technique as used in this 
study gave rapid learning without undue 
preliminary training 

The results obtained from the original 
study and the present one are surprisingly 


similar. A comparison of Fig. 2 with 
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that given by Yerkes and Dodson (17, 
p. 479) shows that the optimum motiva 
tion is in general slightly lower in the 
scale used than that encountered by 
them, and that there is no level of difh- 
culty for which an optimum was never 
reached as was the case with their easiest 
discrimination. An increasing sharpness 
of the optimum peak with increasing dif 
ficulty is characteristic of both graphs. 
The dissimilarities mentioned can all be 
ascribed to differences in the levels of 
motivation selected. The relation of 
motivation to degree of learning is typi 
cally curvilinear, a decrease in learning 
following motivation more intense than 
the optimum, so it is not always easy to 
select levels of motivation to 
reveal the optimum learning, especially 
if it is required to include different optima 
associated with different levels of diffi 
culty, and more especially with a novel 
motivation like air deprivation 
which little is so far known. 

The confirmation of the Law is the 
more striking because of this difference 
In motivation. 


suitable 


about 


The needs arising from 
electric shock and from air deprivation 
are different physiologically, though they 
both give rise to drives mediated, in 
part, by fear. The further generalization 
of the Law must await experimentation 
using other situations and other drives, 
but the outlook is promising in view of 
the analogies in the human field already 
existing. 

The possible role of anoxia must not be 
overlooked in connection with the use of 
air deprivation, since it is known (11) 
that it can cause learning defects com 
parable to those caused by electrocon 
vulsive shock. It seems unlikely, how 
ever, that air deprivation of the duration 
used here can have led to anoxic damage 
to the nervous system sufficient to 
account for the curvilinear relation 
with learning. ‘To the arguments which 
lead Mason and Stone (14) to reject this 
explanation of their comparable findings, 
we may add the following germane con 
from this experiment. 
Firstly, if anoxia were important in this 
connection, it reasonable to 
anticipate that all the groups subjected to 


siderations 


would be 
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the greatest air deprivation would show 
a decrement in learning. A glance at 
Fig. 2 shows that this is not the case. 
Secondly, the analysis of the speed of 
swimming showed no significant differ 
ence between the difficulty levels 
Table 1). Now, the Ss assigned to the 
difficult discrimination had, at the time 
when the were selected, 
experienced much more air deprivation 
than had the other groups. They had 


(see 


speed scores 


made many more errors obliging them to 
hold their breath longer before escape, 


and they had also undergone many more 
trials. Neither of these circumstances, 
likely to increase any deficit ascribable 
to anoxia, depressed swimming speed, 
however. 

The drive characteristics of emotional- 
ity as defined by open-field defecation 
scores are only partly elucidated by this 
experiment. Emotionally 
have a significantly higher speed of swim 
ming 
ously 


reactive Ss 


that is, they respond more vigor 
to the same degree of imposed 
motivation. To this extent, they may 
be regarded either as having a higher 
drive level in general or as more sus- 
Further work 
to investigate the speed of swimming of 
these Ss, and employing a situation de- 
signed to demonstrate the Crespi effect 
(5) consequent upon changes in drive 


ceptible to drive arousal. 


level, confirms the superior speed of the 
emotionals and the presence of the “‘ela- 
tion” effect but indicates that they do 
not respond more to a sudden increase 
in drive than do the nonemotionals (4). 
It thus seems that drive level rather than 
drive arousal is involved. 

Yet this higher drive level did not in 
general effect learning in this experiment 
in the manner expected. It seems un 
fruitful to enter into detailed speculations 
about the reasons tor this seeming para 
dox—suffice it to say that it is probable 
that the situation used here is too com 
plex to expect that any definitive solution 
may be found in the data. 
Nevertheless the general finding relating 
to drive level does have some bearing on 
the postulated relation 
tionality in rats and 
humans. This cross-species identifica 


present 


between emo- 


neuroticism in 
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tion is not directly verifiable, but it can 
be cautiously said that the present re 
at the very least, do nothing to 
render it improbable. 


sults, 


SUMMARY 


designed to test the 
Yerkes Ly xdson 
investigate some of the drive characteristics of 
Ina 3&4 2 factorial 
design having five replications, 120 male albino 
Three levels of difficulty of 
brightness discrimination, four 
from different 
degrees of air deprivation, and two levels of 


The experiment was 
validity of the Law and to 
emotionality of rats 


rats were used as Ss. 
an underwater 
lev els of 


motivation deriving 


emotionality defined in terms of defecation 


Time 
and error scores from 100 trials were secured 


scores on the open field test were used 


Analysis of variance of the results shows that 
the Yerkes-Dodson Law, as demonstrated by an 
appropriate interaction between difthculty and 
motivation, is confirmed at an acceptable level 
of significance. The prediction relating to the 
only 


fulfilled in part; a significantly higher drive level 


effects of emotionality on motivation is 
of emotionals as shown by their swimming speed 
is shown 
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TRANSFER FROM VERBAL PRETRAINING 


TO MOTOR 


PERFORMANCE AS A FUNCTION OF RESPONSE 
SIMILARITY AND ANGLE OF MOVEMENT! 


DONALD R. HOFFELD 


University of Wisconsin 


The effects of verbal pretraining 
upon motor performance have been 
the subject of considerable research 
since 1950. In general the procedure 
in these experiments has been to give 
one or more groups some type of 
verbal pretraining, usually consisting 
of verbal responses to a physical 
stimulus, and then to compare the 
performance of these _pretrained 
groups to a control group which has 
had no pretraining. The results of 
studies in this area have been conflict- 
ing todate. Using simple discrimina- 
tion tasks, several studies (1, 9, 13, 14) 
have found evidence for significant 
positive transfer effects, while several 


studies using more complex tasks 


(12, 16) have found no positive 
transfer effects. This situation was 
clarified to some extent by Battig (3) 
who found that positive transfer 
decreased as task complexity in- 
creased. A number of variables in- 
fluencing the amount of transfer from 
verbal pretraining still remain un- 
clarified, however. 

The purpose of this experiment was 
to determine the effect of difficulty of 
a motor task and the relevancy of 
responses made during pretraining 
upon the amount of transfer. The 
mirror tracing task, first used by 


' This research was supported in part by a 
grant made by the National Science Foundation 
to Dr. W. J. Brogden, and is based on a thesis 
submitted in partial fulfillment of the require- 
ments for the degree of Master of Science at the 
University of Wisconsin. ‘The author wishes to 
express his appreciation to Dr. W. J. Brogden, 
‘under whose guidance this research was done, 
for his helpful suggestions. 


Starch (15), was selected because of 
the relatively simple response and the 
relative simplicity of verbalizing the 
motor response. Preliminary work 
indicated that angle of tracing might 
be a difficulty dimension, so the angle 
between the mirror and the tracing 
pattern was selected as an additional 
variable. The responses made during 
pretraining consisted of drawing lines 
of 0° and 90° in response to verbal 
stimuli. It is assumed that all Ss 
were at a plateau of performance for 
the O° angle before entering the ex- 
periment and that consequently the 
practice on the 90° angle is the only 
important response practice gained 
from the pretraining. In view of this, 
the relevancy of the response during 
pretraining varies directly with the 
angle, i.e., the larger the angle, the 
more relevant the response. The 
selection of these variables then per- 
mits the analysis of the relation be- 
tween task performance and angle of 
tracing, effect of relevancy of pre- 
training upon transfer, and the over- 
all transfer effect. 


Metuop 


Apparatus.—The mirror tracing apparatus 
consisted of a rectangular box 154 in. long, 10 in. 
high, and 15 in. deep. A window was cut in 
the rear of the box and a one-way mirror fixed in 
place so that E could observe the progress of S 
at all times. The interior of the box was 
illuminated by two &-w. florescent tubes. 
Shields were located so as to prevent the light 
from shining directly into S’s eyes and to prevent 
S from seeing the tracing pattern without aid of 
the mirror. ‘Tracing patterns were inserted into 
the box on masonite sheets by E. ‘The tracing 
patterns consisted of a series of patterns com- 
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posed of 0° and 15°, 30°, 45°, O°, or 75° 
The tracing pathway consisted of a 0° section 
4 in. in length followed by a 4-in. section at an 
15°, —30°, —45°, —60°, or —75°, 
followed by another 4-in. section of 15°, 30°, 45°, 
", or 75° 
peated eight times and the pathway concluded 
by a final §-in. 0° section. The pathway was 
approximately 4 in. wide. The patterns were 
reproduced on 84 X 13-in. sheets of paper by the 
Xerox The mirror tracing box was 
placed directly in front of S. The posture of S 
was fixed by use of an adjustable chin rest. A 
mechanical pencil was used for 


angles. 


angle of 


This sequence of sections was re 


process. 


tracing the 
pattern. ‘The arrangement was such that £ sat 
directly across from S and observed the course of 
a trial through the rear of the box. 

Subjects. The Ss 


psychology students who received class points 


were 150 undergraduate 


for their services. "They are assigned to groups 
on the basis of a pretest consisting of a random 
pattern containing one each of the five angles 
used in the experiment. The pattern was con- 
structed in the same manner as described above, 
but contained only one sample of each of the five 
In all, 159 Ss were pretested and 150 
selected for the experiment on the basis of time 
Approximately the first 125 Ss were 
assigned randomly to groups until one group had 


angles 
scores, 


been completed and the remainder of the assign 
ments made in a manner to match the groups’ 
Since 
this matching procedure was used, no attempt 


All Ss 


pretest time scores for means and SD's 


was made to equate the groups for sex. 
were right-handed 

Procedure.—TVhe experimental paradigm used 
is as follows: Phase 1, both experimental and 
control groups took the pretest; Phase 2, the 
experimental group received relevant verbal pre 
training while the control group received non 
both ex 
perimental and control groups received 10 test 
trials 


relevant verbal pretraining; Phase 3, 


Instructions for Phase 1 were given telling S 
that the task was to trace through the pattern 
as quickly as possible, but cautioning him to 
remain within the pathway at all times, and to 
return to the pathway if his pencil left it. After 
completion of Phase 1, Ss were assigned to 
groups as described above 

For Phase 2, all Ss in the control group were 
given nonrelevant verbal pretraining consisting 
The 


Ss in the experimental group learned word 


of learning a list of six nonsense syllables 
association pairs verbally. ‘The three associa 
tion pairs used were away~toward, right-right, 
and toward-away. These were typed on 
3 & 5-in. cards and held at eye level to S. The 
words were learned to a criterion of one perfect 


trial. The S was then seated so that he had 
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directly in front of him a blank sheet of paper, 
and, in response to the first word of each asso 
ciation pair, drew lines to symbolize the second 
word. A sequence of stimulus words was given 
until S had made eight consecutive correct re 
sponses. This completed training for the ex 
perimental group After learning the nonsense 
syllables, the control group was told that the list 
had nothing to do with the rest of the experiment 
Both 


experimental and control groups then received 


and was merely a rest interval activity 


a review of the original instructions, with both 
groups being reminded of the mirror reversal 
effect 

During Phase 3, the experimental (EF) and 
control groups (C) were divided into five sub 
groups eac h, with each of the five angles being 
traced by an experimental and a control sub 
group 
and the angle is measured between the frontal 
plane of the mirror and the orientation of the 
tracing pattern lines 


The five angles used are given above, 


Fach S then traced 10 samples of the same 
pattern. Time scores were recorded on a stop 
watch operated by L At the conclusion of each 
trial, E removed the masonite insert and sub 
stituted another sample of the same pattern 
The interval between trials was approximately 
10 sec. During the course of the test trials, 
all Ss were told on Trial 5 that they were im 
proving. No other knowledge of results was 
given 

Time scores were used as the dependent 
variable in part because Ss were matched on this 
variable. Error scores were not used because of 
the difficulty of objective scoring and the phe 
nomenon of blocking in 


Hoffeld (11). 


mirror tracing (see 


RESULTS 


Bartlett’s test for homogeneity of 


variance (7) was applied to the time 
The resulting chi-square 
490.64 (P < .0O1), which 
indicates significant heterogeneity of 
variances. An inspection of the data 
indicated that a logarithmic trans- 
formation could be reduce 
heterogeneity and help normalize the 
distribution. All 
transformed into log seconds. 


scores. 
value is 


used to 


time scores were 
The 
resulting chi-square value is 127.99 
(.05 > P > .025). An inspection of 
the variances revealed that this sig- 
nificant chi-square value was due to 


one variance (E-75° on Trial 8). If 
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TABLE 1 


ANALYSIS OF VARIANCE FOR Time Scores 
in Loc Seconps 


Source MS 
1.28816 
10.09985 
1.01151 
31111 
04064 
O2868 
02541 
00237 
OO28S 
07397 


Trials (T) 
Linear 
Quadratic 
Cubic 
(Juartic 
Quintic 
Residual 

Pretraining (P) 

‘Angle (A) 

\xP 
Experimental 

Linear 
Quadratic 
Cubic 
(Quartic 
Control 
Linear 73235 
Quadratic 44699 
Cubic 00510 
Quartic 14181 

AX T OLOIS 

F 00668 
OO74¢ 
O2885 


47693 
44682 
OL882 

03841 


*Pp OS 
Pp Oo 


this variance were not included in the 
test, the chi-square value would be 
108.08 (P > .05). In view of this 
fact and also because of the sensitivity 
of the Bartlett’s test, the .05 level of 
significance was used in all F tests. 
An analysis of variance was per- 
formed on the transformed time scores 
The 


indicates a 


and is summarized in Table 1. 
significant F for 
general improvement in the perform- 
of Ss. The any 
significant interactions involving trials 


trials 


ance absence of 
indicates that the form of the learning 
the groups. 
The data for all experimental and 
control were therefore 
bined into a single group curve which 


curves is same for all 


groups com- 
was fitted by orthogonal polynomial 
analysis (8). The breakdown of the 
orthogonal polynomial analysis is also 


given in Table 1. The over-all group 
curve shows significant linear, quad- 
ratic, and The 
significant F for the angle main effect 
indicates a difference in performance 
between the different The 
angle main effect and the angle by 
pretraining been 
broken down by poly- 
nomial analysis, with separate curves 
having been fitted to the experimental 
groups (10). The 


pirical curves are shown in Fig. 1. 


cubic components. 


angles. 
interaction have 
orthogonal 


and control em- 
The experimental group has signifi- 
cant linear and quadratic components, 
while the control group has significant 
linear, quadratic, and quartic com- 


ponents. ‘Tests of the differences be- 


tween means by the Duncan Range 
Test (6) indicate that the experi- 


group for 75° (E-75°) is 
significantly (P = .05) better in per- 
formance than KE-30°, C-60°, and 
E-15°, and that C-75° is significantly 
-30°. 


means 


mental 


better in performance than 
No 
significant. However, in view of the 
fact that the variable of 
evenly and quantitatively spaced, the 


other differences in are 


angle is 


orthogonal polynomial is the more 


powerful test, and the breakdown 


that 


means in the control and experimental 


indicates differences between 
groups exist but do not show up in the 


Duncan Range Tests. 


Fic. 1. 


control groups for time se 


Curves for the experimental and 


res in log seconds asa 


function of angle of movement. 
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Discussion 


Performance as a function of degree of 
relevancy of relevant S-R pretraining is 
shown to be affected by the presence of a 
significant Pretraining X Angle inter- 
action. For the angles (15°, 30°, and 
45°) for which the response made during 
pretraining was less relevant, there ap- 
pears to be an inhibitory effect. This is 
consistent with the results of Battig (2) 
and Battig, Hoffeld, Seidenstein, and 
Brogden (3), both of whom found non- 
significant inhibitory effects of relevant-S 
pretraining. For the angles (60° and 
75°) where the pretraining was more rele- 
vant, there seems to be a facilitory effect. 

The orthogonal polynomial analysis 
indicates that the differences between 
angles for the control group are signifi- 
cant, and inspection of the curve in Fig. 
1 shows that there was a general tendency 
for performance to be higher for the 
easier angles for the groups having rele- 
vant S-R_ pretraining. These results 
are also consistent with those of Battig 
(2), who found that positive transfer de- 
creased with increased task difficulty. 


The reason for the inhibitory effect of 
the verbal pretraining in some cases and 
the facilitory effect in other cases may be 
explained in terms of relevance of pre- 
training and resulting interfering tend- 


encies. It was found in a pilot study 
that it was not possible to have Ss verb- 
alize overtly at each choice point, since 
the time to make the verbal response 
tended to slow down performance too 
much. This is consistent with the re- 
sults of Rossman and Goss (14) who 
found a slight inhibitory effect when Ss 
were required to verbalize aloud. It is 
reasonable to assume that some time is 
required for § to make a verbal response 
covertly, even though this time is less 
than the time required for an overt re- 
sponse. Let this time for the verbal 
response be designated by K, assuming 
that it is constant for all groups. This 
response time will be added to all the 
experimental groups’ time scores. In 
addition, there is a quantity y, which is 
the decrease in time score as a result of 
the verbal response having been made 
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and serving as a stimulus for the proper 
motor response. This quantity y is 
then always negative with respect to the 
time scores. In addition, there is a 
quantity z, which is a function of the 
particular angle used in the tracing pat- 
tern, and a quantity ¢, which is a random 
error component. Thus Time Score = 
z—y+K+e. Itis apparent that the 
relevancy of the response’s decrease will 
effect a decrease in y. Thus when y is 
less than K, an inhibitory effect appears. 
The data are consistent with this explan- 
ation. 

The data on performance as a function 
of angle of movement are roughly in 
accord with those of Corrigan and Brog- 
den (5), who found a symmetrical sine 
function around 45°, with the poorest per- 
formance coming for the 45° angle. The 
task used by Corrigan and Brogden was 
a constant velocity tracking task with 
the angle of tracking as the independent 
variable. 

Using the same task as Corrigan and 
Brogden, Brogden (4) found that per- 
formance varied over trials for the diff- 
erent angles used, with no learning being 
evidenced for some angles. Absence of 
an angle by trials interaction in this pres- 
ent experiment indicates that this does 
not hold true for the mirror tracing task. 
Such differences are probably due to the 
fact that in this research Ss repeatedly 
had to change direction of movement, 
while Ss in Brogden’s study merely 
tracked a pip moving at a constant vel- 
ocity on a straight track. 


SUMMARY 


The purpose of this research was to determine 
whether the amount of transfer from verbal pre- 
training to a mirror tracing task varies differ- 
entially as a function of angle of movement. All 
groups were matched in time scores on the basis 
of a mirror tracing pretest. The dependent 
variable was time to trace a pattern made up of 
horizontal lines and lines forming angles with the 
horizontal lines. The experimental group had 
verbal pretraining of a relevant S-R nature, 
consisting of learning to associate 90° move- 
ments with spoken stimulus-response pairs. 
The control group learned nonsense syllables as 
‘nonrelevant verbal pretraining. Each of the 
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groups was then divided up into five subgroups, 
with each subgroup tracing patterns made up of 
one of five angles. 

A logarithmic transformation was performed 
on the time scores because of heterogeneity of 
variance. ‘The results of the subsequent analysis 
show a differential effect of pretraining for the 
different angles, with a slight inhibitory effect 
appearing for angles 15°, 30°, and 45° and a 
facilitory effect appearing for angles 60° and 75°. 
An analysis by orthogonal polynomials indicated 
significant linear and quadratic trends for the 
experimental group data. Performance is also 
found to be a function of angle of movement for 
the control group. A curve fitted by orthogonal 
polynomials was found to have significant linear, 
quadratic, and quartic components 

It is concluded that transfer from verbal pre- 
training to a mirror tracing task varies directly 
as a function of similarity of responses made 
during pretraining. It is also concluded that 
performance varies as a function of angle of 
movement. 
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In listening to speech, one typically 
reduces the number and variety of the 
many sounds with which he is bom- 
barded by casting them into one or 
another of the phoneme* categories 
that his language allows. Thus, a 
listener will identify as b, for example, 
quite a large number of acoustically 
different Although these 
differences are likely to be many and 


sounds. 


various, some of them will occur along 
an acoustic continuum that contains 
cues for a different phoneme, such as 
d. This is important for the present 
study because it provides a basis for 
the question to be examined here: 
whether or not, with similar acoustic 
differences, a listener can better dis- 
criminate between sounds that lie on 
opposite sides of a phoneme boundary 
than he can between sounds that fall 
within the same phoneme category. 
There are grounds for expecting an 
affirmative answer to this question. 
The most obvious, perhaps, are to be 


found in the common experience that 
in learning a new language one often 


' This research was supported in part by the 
Carnegie Corporation of New York and in part 
by the Department of Defense in connection 
with Contract DA49-170-sc-1642. 

Now at Bell Telephone 
Murray Hill, N. J. 

* The phoneme is most often taken to be the 
smallest unit of that itself, 
distinguish one from another as to 
meaning. ‘Thus, the existence 
bill and dill makes it clear 
different 


Laboratories, 


speech can, by 
utterance 
of the two words 
that b and d are 
English It should be 


emphasized that the phoneme is not a single 


phonemes in 


sound, but is, rather, a class which can and 
usually does include a great many sounds that 
differ from each other in various ways without 


causing any c hange in meaning. 


has difficulty in making all the ap- 
propriate sound discriminations. The 
evidence for this is impressionistic in 
the extreme, and there is little in- 
formation that would permit a defini- 
tion of the more specific aspects of the 
difficulty. In whatever degree this 
difficulty exists, however, a reasonable 
assumption is that some part of it 
arises from the fact that a person who 
is newly exposed to the sounds of a 
strange language finds it necessary to 
categorize familiar acoustic continua 
in unfamiliar ways. If his discrimina- 
tions have, by previous training, been 
sharpened and dulled according to the 
position of the phoneme boundaries of 
his native language, and if the acoustic 
continua of the old language are 
categorized differently by the newgone, 
then the learner might be expected to 
have difficulty perceiving the sounds 
of the new language until he had 
mastered some new discriminations 
and, perhaps, unlearned some old 
ones. 

In more explicit psychological 
terms, an affirmative answer is to be 
expected on the basis that the situa- 
tions being considered here clearly 
meet the conditions for acquired simi- 
larity and acquired distinctiveness. 
If either or both of these processes 
do, in fact, occur, then two speech 
sounds which a listener normally 
lumps into the same phoneme class 
would come to be less discriminable 
than sounds to which he habitually 
attaches different phoneme labels. 
Indeed, one might conceivably find in 
language some and 


very common 
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easily accessible cases in which the 
effects of such processes as acquired 


similarity and acquired distinctiveness 
are as great as many years of practice 
can make them. 

The present experiment was de- 
signed to investigate the relation be- 
tween phonemic labeling 
crimination in one 


dis- 
language and 
within one group of phonemes. For 
this purpose a synthesizer was used to 
generate speech-like sounds and to 
vary them in small steps along an 


and 


acoustic continuum known to contain 
important cues for the perception of 
the voiced stops, 6, d, and g. When 
asked to label these 
sounds as b, d, or g, they normally 
tend by their responses to divide the 
continuum into three sharply defined 
phoneme categories, the shifts from 


listeners are 


one response, or phoneme label, to 
another being very abrupt. It was 
the purpose of this experiment to 
determine how well these same sounds 
can be discriminated, and, in par- 
ticular, to see whether the discrimina- 
tion functions have sharp inflections 
that correspond in position to the 
abrupt shifts (i.e., phoneme bound- 
aries) in the labeling responses. In 
addition, an attempt has been made 
what extent the 
discrimination and 
labeling has here been reduced to its 
theoretical limit. For that purpose, 
the obtained discriminations have 
been compared with a function that is 
computed from the labeling data on 
the extreme that the 
listener cannot differences 


to determine to 


relation between 


assumption 
hear any 


among these sounds those 


that 
phonemic labels. 


beyond 


are revealed by his use of the 


Metuop 


Apparatus.—A_ special-purpose instrument, 


called a pattern playbac k, was used to generate 


the stimuli of this experiment. This instru 


ment, which has been described in earlier papers 
(1, 2, 3), converts hand-painted spectrograms 
» synthesize 

veech-like auditory patterns and to control the 
8 


into sound, thus making it possible t 


various aspects of the pattern quite precisely 

Stimuli illustrates the 
produce the stimuli The 
ulus variable is the direction and extent of the 


Figure l spectro 


grams used to stim 
second-formant transition, this variable having 
been found prev iously (6) to be important for the 
In the 


f the top 


perceived distinctions among 6, d, and ¢ 
stimulus pattern at the extreme left 
row of the figure, the second formant rises from 
a point 840 cps below its steady-state level, and 
in the pattern at the extreme right of the bottom 
row it falls from a point 720 ps above the stead) 
state Between these two extremes, the starting 
steps of 120 cy 

Por convenience these stimuli will be referred 
to by number, from | through 14, as 


in Fig. 1 


point of the transition varies in 
indicated 


of the first formant had 

to be a marker for the 

class of voiced stops (4) and, 

Fig. 1, this first-formant transition is constant in 

all the stimuli. In the steady-state part of the 

pattern the first formant centers at 340 eps and 
20 cps } 


frequencies produce a synthetic appr 


The rising transition 
been found previous! 


as can be seen in 


the second at ormant at these 
imation 
to the vowel ¢ (as in gate) 

The spectrograms were converte 1 into vund 
on ‘the playback 
tape By copying 


pattern and recorded mn 


magnet cutting, and 
tape, all the 
arrangements described below were made 
and 5S The 


were presented to Ss in two ways: 


splicing the magnetic stimulu 


Stimulus presentation stimul 
singly, to 
determine how Ss would label them as b, d, or g¢, 


ABX 


what extent Ss could discriminate them on any 


and in an arrangement to determine to 


basis at all. 


al alalaladete 














Fic. 1} 
patterns from which the stimuli of the experi 
Pattern 14, at the lower 
right, is complete in all respects 


Illustrations of the spectrographic 


ment were pr duced 
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For the labeling part of the experiment, six 
magnetic tapes were prepared, each of which 
contained a different randomization of the 
entire series of 14 stimuli. There was a 6-sec. 
interval between stimuli. These tapes were 
presented to Ss with instructions to judge each 
stimulus as b, d, or g. 

To test discrimination by the ABX procedure, 
the stimuli were arranged in triads, each of which 
consisted of an A stimulus, a B stimulus, and a 
third stimulus, X, which was identical either to 
A orto B. The Ss were instructed to determine 
whether X was the same as A or the same as B. 
It was strongly emphasized to each S that he 
was to make his judgment on the basis of any 
cues he could hear. The measure of dis- 
criminability is, of course, the percentage of the 
time that the X stimulus is correctly matched to 
A or to B. 

It was not the purpose of this experiment to 
obtain actual DL’s, but only to measure relative 
discriminability at every step on the continuum. 
For that purpose, the A and B stimuli of the 
ABX triad were made up by pairing each stim- 
ulus with the stimulus one step, two steps, and 
three steps removed from it. Each stimulus 
was paired in this fashion with the stimuli lying 
to its right on the continuum shown in Fig. 1. 
Thus, Stimulus | was paired with Stimulus 2 to 
form the A and B stimuli, respectively, of one 
ABX triad. Stimulus | was paired with Stim- 
ulus 3 in another triad, and with Stimulus 4 
in a third, Similarly, Stimulus 2 was paired 
with Stimulus 3, with Stimulus 4, and with 
Stimulus 5, and so on for the remaining stimuli 
on the continuum. There was no stimulus one 
step to the right of Stimulus 14, so 14 does not 
appear as an A stimulus in the one-step series. 
For analogous reasons, Stimuli 13 and 14 do not 
appear as A stimuli in the two-step series, and 
Stimuli 12, 13, and 14 are missing from the three- 
step series. 

From the above discussion, it follows that the 
number of A and B combinations was 36. For 
each combination of A and B stimuli, there were 
two triads—one in which X was identical to A 
and one in which X was identical to B. The 
total number of triads was 72. These triads 
were arranged into six tapes of 12 triads each. 
Within each ABX triad, the stimuli were spaced 
at l-sec. intervals; successive triads were 
separated by 10 sec. 

There were two groups of Ss in the experi- 
ment. Since the procedures for the two groups 
were slightly different they will be described 
separately. Group I consisted of five paid 
volunteers, all undergraduate students at the 
University of Connecticut, who had never 
heard any synthetic speech prior to this experi- 
ment. For the first 17 sessions, Ss were given 


only the ABX discrimination task to do, without 
being told that these were synthetic speech 
sounds. Beginning with Session 18, they were 
informed that the sounds on the discrimination 
tapes were synthetic approximations to speech. 
At this point the labeling tapes were introduced, 
and for the next three sessions Ss were given 
four labeling tapes per session with instructions 
to identify each stimulus as a speech sound. 
During this period they were given no dis- 
crimination task. At Session 22 and thereafter 
Ss were asked to identify the stimuli on the 
labeling tapes as b, d, or g, and the discrimination 
task was resumed. In these sessions, the 
discrimination task was always undertaken after 
Ss had finished judging the stimuli on the 
labeling tapes. 

For several reasons the discrimination data 
obtained before and after the introduction of the 
labeling tapes have been combined. First, an 
examination of the results showed that there 
was no obvious change in the discrimination 
judgments following the introduction of the 
labeling tapes and the instruction to judge the 
stimuli on those tapes as b, d, or g. Second, in 
the sessions in which the labeling tapes were 
first introduced, Ss reported that they had previ- 
ously heard the sounds on the discrimination 
tapes as speech. Moreover, when they were 
asked to identify the speech sounds they heard, 
they responded mostly with 5, d, and g. 

When the discrimination data are combined, 
there are, for each S in Group I, a total of 21 
judgments of each ABX triad. Since there were 
two triads for each combination of A and B 
stimuli, the total number of judgments of each 
A and B combination by each S was 42. For 
each of the stimuli on the labeling tapes there 
are, for each S, 32 judgments. All of the label- 
ing judgments were obtained after Session 21— 
ie., after Ss had been asked specifically to 
identify the stimuli on the labeling tapes as 5, 
d, or g. 

Two of the five Ss in Group I were eliminated 
because they failed to apply the phoneme labels 
consistently. Since these Ss did not clearly 
divide the stimulus continuum into phoneme 
categories, one cannot compare their discrimina- 
tion of speech sounds within and across phoneme 
boundaries. It should be noted here that the 
stimuli of this experiment have previously been 
presented to large groups of listeners, and that 
it is quite unusual to find as many as two out of 
five who are as unreliable in their responses as 
the two Ss who were eliminated from this 
experiment. 

Group II consisted of four workers at the 
Haskins Laboratories. These Ss had previously 
had extensive experience in listening to synthetic 
speech sounds. The procedure for Group II was 
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in terms of percentages are based on 32 and 42 judgments for the labeling and discrimination data, 


respectively. 


like the procedure for Group I after Session 21. 
Thus, in each session, Ss identified the stimuli 
on one labeling tape, and then judged three of 
the discrimination a total of 36 ABX 
(This not be followed 
exactly in the last few sessions due to scheduling 
difficulties.) In all, each S identified each of the 
stimuli on the labeling tapes 25 
\BX 
Since there were two triads for each 
each S judged each 
bination a total of 26 times 

Of those Ss whose data will be shown in- 
dividually in the following section on results, 
CD and RV are from Group I, while LG and 
KH are from Group il 


tapes 


triads plan could 


times, and he 
triad a total of 13 times. 
\ and B 
\ B com- 


judged each 


combination, 


RESULTS 


In order to describe the method of 
presenting the to 
indicate in general terms the outcome 
of this study, the results that were 
obtained with one S, CD, will be 


data, and also 


presented first. At the upper left of 
Fig. 2 are plotted the labeling re- 
sponses made by this S when the 14 
stimuli were presented to him one at 
a time and in random order for judg- 
ment as b, d, or g. It can be seen 
that Stimuli 1, 2, and 3 were identified 
primarily as b, Stimuli 5, 6, 7, 8, and 9 
as d, and Stimuli 11, 12, 13, and 14 
as g. The shifts from one response 
to another are very abrupt, which is 
to say that the phoneme boundaries 
for this S are very sharp and stable. 
The discrimination obtained 
with the same S, CD, are also shown 
in Fig. 2. Only the “obtained” data 
points indicated by open circles and 
connected by solid lines will be con- 
sidered at this time. Each point rep- 
resents the percentage of correct 
responses for all ABX: presentations 


data 
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(both ABA and ABB) when the A 
stimulus had the value shown on the 
abscissa and the B stimulus was one, 
two, or three steps removed (for the 
one-, two-, and three-step curves, 
respectively). Thus, for example, the 
first point on the one-step curve 
shows that this S correctly discrimi- 
nated Stimulus | and Stimulus 2 (one 
step higher on our stimulus scale) 
54% of the time. 

A comparison of the discrimination 
functions with the labeling functions 
indicates that, other things equal, this 
S does, indeed, discriminate better 
between stimuli that lie on either side 
of a phoneme boundary than he does 
between stimuli that fall within the 
same phoneme category. For ex- 
ample, it can be seen from the labeling 
curves that Stimuli | and 3 were both 
identified as b almost all of the time, 
and the two-step discrimination curves 
show that this S correctly discrimi- 
nated these stimuli only 55% of the 
time. ‘Two steps beyond Stimulus 3, 
which was almost always identified as 
b, is Stimulus 5, which was always 
identified as d. The discrimination 
curves show that these two stimuli, 
consistently labeled as different pho 
nemes, were correctly discriminated 
100% of the time. 

It must, of course, be supposed that 
if a listener can always identify two 
stimuli as different (as in the case of 
Stimulus 3 and Stimulus 5), he will 
surely be able to discriminate them. 
On the other hand, it does not neces- 
sarily follow from the fact that he 
identifies two stimuli as the same 
phoneme (as with Stimulus 1 and 
Stimulus 3) that he cannot discrimi- 
nate them. One might think that he 
would hear two types of b—i.e., that 
he would hear what the linguist calls 
allophonic variations. In 
ample cited, S does not. 


the ex- 
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Clearly, the data obtained with this 
S are not all so neat and striking as 
the particular examples chosen, and 
some of the other Ss were more vari- 
able, especially in their responses to 
the discrimination task, than the one 
S, CD, whose responses have been 
shown in Fig. 2. It is, nevertheless, 
reasonably apparent from an inspec- 
tion of the data of all Ss that the 
discriminations tend to be relatively 
more acute in the vicinity of phoneme 
boundaries than in the middle of 
phoneme categories. Before present- 
ing these other data, however, it is 
desirable to provide a basis on which 
all the results can be evaluated. For 
that purpose the working hypothesis 
will be stretched to a theoretical limit 
and its quantitative implications will 
be developed. 

Make the extreme assumption that 
S can discriminate the stimuli only 
to the extent that he can identify 
them as different phonemes. Then 
suppose that in the discrimination 
task—i.e., when S is presented with 
the stimuli in ABX ‘fashion and asked 
to say whether X is like A or like B 
he can only assign the phonemic 
labels 6, d, and g to the individual 
stimuli, and that he has no other basis 
for discriminating among the stimulus 
members of the various ABX triads. 
One can, then, use S’s responses in the 
phonemic labeling part of the experi- 
ment as a basis for calculating the 
frequency with which he will cor- 
rectly discriminate in any given ABX 
arrangement of the stimuli. To do 
this, one must first refer to the labeling 
part of the experiment and take 
account of the relative frequency with 
which S identified each of the 14 
stimuli as b, d, or g. It is, of course, 
possible to go from these data to 
calculations of the probabilities that a 
given ABX triad will be heard as any 
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one of the possible sequences of the 
three phonemes. One then needs 
only to determine for each triad which 
of the possible phonemic sequences 
will lead to responses that would be 
counted as correct discriminations. 


By reference to the phonemic labeling part 
of the experiment we first determine for each 
A and B stimulus of the various ABX triads the 
relative frequency with which it was identified 
as b, d, and as g. ‘These relative frequencies 
will be used as estimates of the probabilities of 
hearing the various stimuli of the ABX triads as 
b, as d, and as g. Let us call the probabilities 
of hearing these phones fp», pa, and p, in the case 
of an A stimulus and py’, pa’, and p,’ in the case 
of a B stimulus. We assume next that the 
various stimuli within each triad are perceived 
independently of each other. It follows, then, 
that the probability (for a given ABA triad) 
of hearing a particular sequence, such as b, d, g, 
is pe pa’ py. Since there are three alternative 
responses, there are 3X 3 X 3, or 27, such 
phoneme sequences possible. 

For any given triad the 27 possible phonemic 
divided into three 
whether they lead S to 

would be 


classes 
according to make 
responses that counted as correct 
discriminations, as incorrect discriminations, or 
as discriminations that would on the average be 
correct half of the time and incorrect half of the 
time. In the case of any ABA type of triad, for 
example, S will be correct for any sequence in 
which the first and third stimulus members of 
the triad are heard as the same phoneme and the 
second member is heard as a different phoneme. 
He will be incorrect (again for the ABA type of 
triad) whenever the second and third members 
are heard as the same phoneme and the first is 
He will be 
correct half of the time and incorrect half of the 
time with two types of phoneme sequences: 
(a) all those in which he hears the first and 
second stimuli as the same phoneme and 
(b) those sequences in which he hears three 
different phonemes.‘ 


sequences can be 


heard as a different phoneme. 


*When S hears a sequence of three different 
phonemes, he can be expected to make the 
correct discrimination half of the time only if 
we assume that he perceives each phoneme as 
equally like the other two. It is possible that 
the typical listener does not perceive b, d, and g 
in precisely this way. However, the labeling 
data of this experiment are such as to produce 
essentially zero probabilities of hearing such 
sequences in the ABX triads. We have, there 


363 


These considerations can be expressed quan- 
titatively in the following way. Let P Corriama) 
be the proportion of the time that the listener 
is correct on a number of presentations of the 
same ABA sequence, Pz be the proportion of the 
time that the listener heard a sequence which 
would lead to correct discrimination, Pw be the 
proportion of the time that the listener heard a 
sequence which would lead to incorrect dis- 
crimination, and P; be the proportion of the 
time that the listener heard a sequence which 
would, with equal likelihood, lead to correct and 
incorrect discriminations. Then 


P Corriana) = 1Pe + OPw + SPy. 


If the probabilities of the particular se- 
quences described above are substituted ap- 
propriately for the general expressions Pa, Pw 
and P;, and if the resulting equation is then 
manipulated algebraically, we obtain 


P Corriapa) 


_ tht pa + pa? — pope’ — papa’ — PoPo’ 
4 


So far, we have been concerned only with the 
case in which the presented sequence is ABA; 
analogous considerations lead to a 
ABB, although the particular 
sequences which are correct and incorrect are 
different. 


similar 
equation for 


P Corr ABB) 
the + pa? + Pe — pope’ — papa’ — PoPo 
3 . 
The ABA and ABB sequences were presented 


equally often; therefore, an average P Corr is 
given by 


P Corrmn dt (Po= Put + (ta~ Bal )*+ (Po~ Po’)® 
7 4 ‘ 


The solid points connected by 
dashed lines in Fig. 2 represent the 
discrimination values derived from 
CD’s labeling curves on the assump- 
tions outlined above. It is seen in 
the case of this S that the predicted 
functions do indeed take care of some, 
although by no means all, of the 


fore, not yet attempted to determine how the 
similarities among 6, d, and g are perceived, 
because a correction, no matter how large, would 
have negligible effect on the results being re- 
ported here. 
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KH Kach of the obtained value 
for LG and KH 


variations in the obtained discrimina- 
tions. ‘The assumptions predict the 
points of high and low discrimination 
reasonably well, but they lead one to 
expect a general level of discrimina- 
tion slightly lower than that obtained. 

In Fig. 3 are daga obtained with 
three other Ss. 
presented as a way of indicating the 
results for all Ss but only to show 


These data are not 


additional details of the results, in 
cluding in particular a sample of the 
individual differences among Ss. It 
is seen that the position and number 
of the peaks in the predicted dis- 
crimination functions vary somewhat 
from one S to reflecting 
differences in the way Ss had assigned 
phoneme labels to the stimuli when 
they were presented for identification. 
It is also apparent that the obtained 


another, 


Predicted and obtained discrimination values at two and three steps for RV, LG, and 
is based on 42 judgments in the case of RV, and on 26 judgments 


follow the inter-S 
differences in the predicted functions 
fairly well. 


The simplest way to summarize the 


discriminations 


data for all Ss is to make a scatter plot 
of obtained values against predicted 
Such plots are shown in 
Fig. 4, 5, and 6, for one-, two-, and 
three-step 


values. 


discriminations, 
For the two-'and three-step 


respec- 
tively.® 
data, regression lines have been fitted 
by the method of least squares, and 
these are shown for each set of points. 
The regression for the one-step data 
has not been determined because so 


§ It should be noted here that the reliabilities 
of all points in the scatter plots are not equal, 
since, as was pointed out under the section on 
procedure, the two groups of Ss made different 
numbers of judgments of the various ABX 
triads. 
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Fic. 4. Scatter plot of predicted vs. obtained 
values on the one-step discrimination for all Ss 
The small numerals have been placed on the 
graph to indicate, where necessary, the number 


of values that occupy the same position of the 


coordinates 


few predicted points lie above 50°, 
that a meaningful fit cannot 
tained. 


be ob- 
The one-step data do not, 
therefore, provide a good test of our 
assumptions. [or the two- and three- 
step data we should, of course, sup- 
pose that if the obtained data were 
essentially as predicted, give or take 
a little experimental error, the regres- 
sion lines would be described by the 
equation “x = y.” 

The relationship between predicted 
and obtained values has been meas- 
ured by tau,® a 
parametric measure of correlation 
The correlations are +.14, +.43, and 
+.43 for the one-, two-, and three- 
step 


computing non- 


discriminations, respectively. 


Significance levels, in the same order, 


6 We have 


because our data fail t 


used a nonparametric measure 


» meet the assun ption of 
h ymoscedasticity For a descr ption of tau, 


ce Kendall (5). 
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are P= <.08, P= <.001, and 
P = <.001. Thus, it is seen that the 
correlations are highly significant for 
the two- and three-step discrimina- 
tions, but in the case of the one-step 
discrimination the relationship could 
have arisen by chance. The failure to 
obtain significance for the one-step 
discrimination is not surprising since, 
as was pointed out earlier, most of the 
one-step points are predicted to lie 
at 50%. 

Although there is a significant 
relationship between obtained and 
predicted points for the two- and 
three-step discrimination data, it is 
apparent in both cases that the lines 
of best fit are systematically displaced 
upward, This indicates that while 
the assumptions predict fairly well the 
occurrence and location of the inflec- 
tions in the discrimination curves, 
they apparently lead to an under- 
estimation of the general level of 
discrimination. This was previously 
noted in the data of the individual Ss 
as shown in Fig. 2 and 3, where it was 
seen that the obtained discrimination 
functions tended to fit quite well with 
the predicted curves except that they 
were in general at 4 slightly higher 
level. 


It is difficult on the basis of the data 
now available to make an unequivocal 
interpretation of the difference in level 
between obtained and predicted dis- 
crimination functions. One possibility, 
of course, is that this discrepancy repre- 
sents a margin of “true” discrimination 
—i.e., an ability of § to distinguish the 
speech sounds, not solely on the basis of 
the phonemic labels, but also more di- 
rectly by the essential acoustic differ- 
ences among the patterns. A very dif- 
ferent possibility is that the discrepancy 
between obtained and predicted is the 
result of certain detailed aspects of the 
experimental procedure. For example, 
irrelevant discriminable aspects of the 


stimuli, such as accidental stray noise, 
could have provided Ss with an extrane- 
ous basis for deciding, in regard to the 
ABX triads, whether X was A or B. 
Such additional stimuli would, of course, 
have had no effect on Ss’ responses in the 
phonemic labeling part of the experiment. 
The result would have been, then, to 
make the obtained discriminations some- 
what better than one should have ex- 
pected them to be. In general, the pro- 
cedures of this experiment were not such 
as to control most effectively for these 
irrelevant discriminanda. It will be 
possible in future research to take greater 
precautions against their occurrence, and 
then to determine whether and to what 
extent the discrepancy between obtained 
and predicted is reduced. 

Within the data of the present experi- 
ment, however, there is some evidence 
that the discrepancy between obtained 
and predicted discrimination functions 
is due, at least in part, to “true” dis- 
crimination. This evidence is to be 
found in the fact that the discrepancy 
would appear to be greater for the three 
step than for the two-step function. One 
might expect to obtain this difference 
if § were truly discriminating the stimuli 
on the basis of their essential acoustic 
characteristics, since one would then 
presumably find as between the two- and 
three-step discriminations that the three- 
step discrimination was the easier. 
Other factors, such as the irrelevant dis- 
criminanda discussed in the preceding 
paragraph, would have affected the two- 
and three-step conditions equally, and 
would not have caused the departure 
from predicted values to be greater in 
the three- than in the two-step data. 


Discussion 


The results of this experiment cannot 
be assumed unequivocally to reflect the 
effects of learning on discrimination. 
There is, of course, the possibility that 
the inflections in the discrimination func 
tion are given innately, and that the 
phoneme boundaries have been placed 
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so as to coincide with these discontinui- 
ties. This begins to seem unlikely when 
one considers that other languages have 
put phoneme boundaries at different 
places along the 4-d-g continuum. In 
itself, this would appear to reduce the 
probability that the sharp inflections in 
the discrimination functions are innately 
given. One would have far more com- 
pelling evidence, however, if it were 
found that native speakers of such lan- 
guages have their points of maximal and 
minimal differential sensitivity displaced 
along the continuum to correspond with 
the phoneme boundaries of their re- 
spective languages. Research to test 
this possibility is feasible. 

In order to find out whether the effects 
here described represent acquired simi- 
larity, acquired distinctiveness, or a com- 
bination of both, it will, of course, be ap- 
propriate to obtain discrimination data 
on nonspeech stimuli that are otherwise 
identical with the synthetic approxima- 
tions to speech. Complete identity is 
obviously impossible, unless one can get 
Ss to hear the same sounds sometimes as 
speech and sometimes not. One can, 
however, reasonably expect to make re- 
vealing comparisons between discrim- 
ination data obtained with speech and 
nonspeech stimuli that vary along the 
same, simple stimulus dimensions. 
Sheer temporal duration, for example, is 
sometimes a cue for distinguishing speech 
sounds. (Duration of transitions dis- 
tinguishes stop consonant from semi- 
vowel; duration of fricative noise is a 
cue for distinguishing among the classes 
fricative, affricate, and stop.) It will 
be possible to obtain discrimination func- 
tions for variations in duration when 
those variations cue the perceived differ- 
ences among phonemes, and to obtain 
comparable data for variations in dura- 
tion of nonspeech sounds. We should 
suppose that a comparison of discrimina- 
tion functions such as these would help 
greatly to determine whether the typical 


listener’s long training in speech per- 


ception has served selectively to sharpen 
or to dull his discrimination of speech 
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sounds, or whether, perhaps, it has done 


both. 
SUMMARY 


This experiment was designed to measure the 
relation between Ss’ phonemic identifications 
(as b, d, g) of certain synthetic speech sounds 
and the extent to which they can discriminate 
the sounds as being different in any way. The 
stimuli were two-formant approximations to 
consonant-vowel syllables. ‘They varied in the 
extent and direction of the second-formant 
transitions, this variable having previously been 
found to be an important cue for the perceived 
distinctions among }, d, and g. 

In one part of the experiment these sounds 
were presented singly and in random order for 
identification as 6, d, or g. The responses 
obtained in this way tended, with most listeners, 
to divide the stimulus continuum into three 
sharply bounded phonemic categories, indicating 
that the perceived shifts from one phoneme to 
another were rather abrupt. 

In a second part of the experiment an ABX 
procedure was used to measure Ss’ ability to 
discriminate these The results in 
dicated that discrimination was better at the 
phoneme boundaries than in the middle of the 
phoneme That is, with acoustic 
differences equal, Ss discriminated better be 
tween speech sounds to which they habitually 
attach different phonemic labels than they did 
between sounds which they normally put into 
the same phoneme class. ‘To obtain a more 
nearly precise evaluation of this effect, the 
obtained discrimination curves were compared 
with those that would be predicted from the 
phoneme identification curves on a most extreme 
assumption: namely, that S can only hear those 
differences that are revealed by the way in which 
he attached phonemic labels to 


The discrimination curves that were produced on 


sounds. 


category. 


the stimuli. 


this assumption predicted fairly well the occur- 
rence and location of points of high and low 
discriminability in the obtained functions, but 
Ss tended in general to discriminate the stimuli 


somewhat 


better than expected from the 


extreme assumption 
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RETENTION 


OF VERBAL ASSOCIATES AS A FUNCTION OF 


STRENGTH ! 


WILLARD N. RUNQUIST? 


Northwestern University 


This study was concerned with some 
basic relationships between interfering 
responses and forgetting of items in a 
the 
retroactive and proactive inhibition 
designs, where thé sources of interfer- 


paired-associate verbal list. In 


ence can be at least partially specified, 
retention has been assumed to be a 
function of the relative strengths of 
the competing responses at the time of 
recall. 
are the studies varying degrees of 
first-list learning (6, 14), degree of 
second-list learning (1, 7, 8, 9, 11, 12), 
or both (3), which have shown an 
increase in retroactive inhibition (RI) 
and proactive inhibition (PI) as the 
degree of learning of the interfering 
list increases and a decrease in RI as 
the degree of learning of the recalled 
In this study an at- 
tempt was made to analyze the reten- 
tion of the individual items within the 
list as related to their strength. 

Two 
The first of these was the relation be- 


Supporting this assumption 


list increases. 


relationships were studied. 
tween retention and the strength of 
the 
two responses are learned to the same 


Recall of both first- and 


second-learned responses was studied 


the competing responses when 
stimulus. 


indicated 
differences in recall due to the tem- 


since some evidence has 
poral position of the interfering learn- 


ing (13). The 


vestigated was 


second relation in- 
that for- 
getting and item strength when there 


between 


1This study is abstracted from a doctor’s 
lissertation submitted to the Graduate School, 
Northwestern University. The 
debted to Dr. Benton J. Underwood for his 
supervision 

2 Now at the State University of Iowa. 


author is in 


was no specific interfering learning. 
It is known at a gross level that re- 
sponse strength influences retention, 
the the 
relationship is not known. 


although exact nature of 


Metuop 


Three 


learned lists of eight paired adjectives 


of 104 Ss each 
Only Ss 


who had not served in verbal learning experi 


> , 
Procedure groups 


ments were used in order to reduce proactive 


interference from outside this particular experi 
ment to a minimum 
were taken fr 


Underwood (13) 


ntained 10 pairs so two 


Four sets of two lists 
previous study by 
lists originally re 
were dropped arbitrarily from each list 
within a set were identic al, 
different. Intralist 
acl S 


an equal 


stimuli of the two list 
with the 
and interlist 


responses be ing 


similarity were low 


learned lists from a single set with 


number of Ss in each group learning each list in 


The 


lists was to avoid biasing the re 


each set first purpose of using several 
ilts with specific 
three 
groups were designated RI, PI, and Control 
The control group learned and recalled a single 


list, while the RI and re learn 
two lists of a set | I 


item effects as much as possible The 


group recall ne the first list and the grouy 
the second list A‘recall trial and four addi 
ufter 24 hi 
m to 


tional relearning trials were given 
careful instruct 
insure that Ss knew exactly which list they 
to recall in the RI and PI conditions 
Lists were presented on a Patterson 


Memory 


intertrial 


An attempt was made by 


were 


S-PA 


, rate with a 4-see 


Drum at a 2:2-se 
interval All learning was by the 
anticipation method to one perfect trial plus two 
additional trials. Five orders of 


of the pairs were used to minimize ser 


presentation 
al learning 
There are 
strength of 
The m t 


n terms of the number of correct 


Measurement of response strength 


7 
several way in which the re SpPorise 
the items could be dimensionalized 
obvious way i 
(reinforcements) on each item 
If the eight items for each S are 


according to the 


anticipations 
rane 
number of reinforcements, 


these ranks can be used as an ordinal dimen 
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sion of response and 


items for 


strength retention of 
rank can be plotted. This 
method of dimensionalizing strength has the 
advantage of having each S contributing one 
item to each point on the dimension. 


each 


It was 
selected over several other possible methods 
because the alternative methods appeared to 
give biased of the of for- 
getting. 

If the raw percentage of items recalled for 
each rank were used as a measure of forgetting, 
this would not give a true picture of this relation- 
ship 


estimates amount 


If, for example, recall were given im- 
mediately by having an additional learning trial, 
the items of Rank | (large number of reinforce 
ments) would be correctly 
Rank &. ‘Thus, a measure of 
forgetting must only take into account the loss 
of items beyond that obtained with immediate 
recall if it is not to reflect merely the learning 
differences. ‘This problem has been more fully 
discussed elsewhere (15). An estimate of the 
expected percentage recalled on the trial im 
mediately following the completion of learning 
may be 


recalled more often 


than items of 


obtained by successive 
analysis (15). 


taking each 


probability 

Briefly, this method consists of 
item for each S and tracing it 
through the course of learning noting whether 
it was correct or incorrect following each rein 
forcement. By pooling the data over items and 
Ss, the percentage of items correct on the next 
trial following 1, 2, 3, ete. 
obtained. 


reinforcements is 
These percentages can be used as 
estimates of the probability that a certain item 
will be correctly anticipated on an immediate 
recall trial by noting the total number of rein- 
forcements and looking up the value of the 
percentage. 

The more homogeneous the items which are 
pooled, the more accurate will be this estimated 
probability. probability 
analyses were performed as described above for 
fast and slow learning Ss as determined by 
dividing the distribution of trials to criterion at 


Therefore, separate 


the median. Separate analyses were also 
performed within each category for items of 
Ranks | and 2 combined, Ranks 3 and 4combined, 
Ranks 5 and 6 combined, and Ranks 7 and 8 
Thus the probability 


(percentage) of an item being correctly antici- 


combined. predicted 


pated at immediate recall was determined only 
from items of the same approximate strength 
from Ss with approximately the same speed of 
learning. 


Retention was measured in terms of absolute 
The obtained 
percentage recalled for each rank was subtracted 
from the predicted percentage. The predicted 
percentage was obtained by averaging the pre- 
dicted probabilities for each item obtained from 


loss in probability or percentage. 


WILLARD N. 


RUNQUIST 


the analysis described above. Whether or not 
this gives an accurate picture of forgetting de- 
pends upon certain assumptions concerning the 
relationship of forgetting to strength. By using 
the absolute loss, the assumption is made that 
Thus a 50% loss would 
be treated as an equal amount of forgetting 
whether the predicted recall was 90% or CO%. 
Another and possibly intuitively more plausible 
method, would be to measure forgetting in terms 
of relative amount forgotten. This is done by 
taking the percentage of the expected recall 
which is obtained. In these data, the results 
obtained by both methods were almost identical, 
hence the absolute measure will be used as it is 
more convenient to handle statistically. 


this relation is linear. 


Resutts 


Retention of items in a single list. 
In determining a relationship between 
response strength and forgetting, only 
the data from the control group are 
considered. The mean number of 
items recalled per S after 24 hr. was 
5.83 + .12 which represents 73% 
recall. The absolute loss in prob- 
ability for the eight ranks is shown in 
Fig. 1 with Rank 8 representing the 
weak items and Rank | the strong. 
The mean predicted probabilities for 
the eight ranks were .62, .75, .85, .87, 
.92, .96, .97, and .99 from Rank 8 to 
Rank 1, respectively. While it may 
not be legitimate to space the ranks 
equally, the nature of the relationship 
which shows that forgetting first in- 
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creases then decreases is not affected. 
In fact, the increase in forgetting for 
the weaker items is more extended if 
the peints are plotted with the pre- 
dicted probabilities on the abscissa. 

Statistical evaluation of the rela- 
tionship required obtaining a score for 
each item for each S. This involved 
subtracting the obtained percentage 
recalled from the predicted percentage 
recalled for each item. Since a single 
item is either recalled (100%) or not 
recalled (0%), 100 was subtracted 
from the predicted percentage (prob- 
ability) of each item correctly recalled, 
while zero was subtracted from the 
predicted percentage of each item not 
correctly recalled. Actually, these 
loss scores are artificial in terms of 
percentage. What was desired was to 
assign a value to each item in such a 
way that S would receive more credit 
for getting a weak item correct than 
he would for getting a strong item 
correct. The loss scores are thus 
weighted recall scores for each item. 
The mean of these loss scores for each 
rank corresponds exactly with the 
over-all loss as determined above. 
Since the distributions of loss scores 
were bimodal, a Friedman chi-square 
(10) was used to test for differences 
between the ranks. The resultant 
chi square was 74.07 which was highly 
significant (P < .0OOl for 7df). Fur- 
ther evidence for the reliability of 
this relationship is that the same 
general function has been obtained by 
Underwood in two other independent 
instances, although these data have 
not been published.’ 


In view of this unusual relationship one 
might ask whether some procedural! arti- 
fact might have produced the results. 
If there is such an artifact it could not 
be discovered. Several methods of cate- 
gorizing items as to strength produced 


* B. J. Underwood, personal communication, 


October, 1956. 
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similar shaped functions, once the known 
sources of bias were taken into account. 
These biases were usually in the form of 
either distorted estimates of the pre. 
dicted probabilities or selection of items 
for particular points from certain kinds 
of Ss such as fast and slow learners. 
Therefore, within the limits of this study, 
this appears to be the relationship be 
tween the strength of the association 
and the amount of forgetting. It should 
be pointed out that there is no a priori 
reason why the interaction of interfer- 
ence within the list, which is presumably 
responsible for the differential forgetting 
of various items, should produce the same 
law as when response strength is varied 
by having lists learned to a different cri- 
terion. A systematic study of the in- 
fluence of variables affecting interfer- 
ence, such as similarity, would appear to 
be necessary before a satisfactory ac- 
count of the phenomenon reported here 
could be made. 


RI and PI.—The mean number of 
items recalled under RI and PI condi- 
tions were 3.36 + .13 and 3.41 + .17, 
All items which were in 
the recalled list for these two groups 
were rank ordered in terms of the 
number of reinforcements. on that 
item, and independently on the num- 
ber of reinforcements on the corre- 
sponding interfering item. A _ two- 
way classification was then made of 
all items according to the rank order 
of the recalled item (List | for the RI 
group and List 2 for the PI group) and 
the rank order of its corresponding 
interfering item. Ranks 1 and 2, 3 
and 4, 5 and 6, and 7 and 8 were 
combined, in order to achieve better 
reliability by including more items at 
each point. This gave a4 X 4 classi- 
fication for each group. In order to 
measure the amount of RI and PI at 
each point on the strength dimension, 
it was first necessary to determine the 
loss (forgetting) in probability or 
percentage at each point corrected for 
differences in initial strength exactly 


respectively. 
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ABSOLUTE RI AND PI 


INTERFERING ITEMS 








iT 1 1 
7-6 56 3-4 1-2 


RANK ORDER OF RESPONSE 
Fic. 2. Absolute RI and PI as a function of 


the rank order of the recalled items and the rank 
order of the interfering item. 





as in the control group data. The 
amount of absolute RI and PI was 
then determined by subtracting the 
loss which took place without specific 
interfering learning; i.e., the loss in 
probability for the control group at 
that point. RI and PI calculated in 
this way are plotted in Fig. 2. The 
upper function gives RI and PI as a 
function of the rank of the recalled 
item ignoring the rank of the inter- 
fering items, while the lower function 
gives RI and PI as a function of the 
rank of the interfering item ignoring 
the rank of the recalled item. 

For purposes of statistical evalua- 
tion a score was obtained for each S 
at each point by subtracting the ob- 
tained percentage recalled from the 
predicted percentage recalled, and 
then subtracting the control group’s 
percentage recalled for the same point. 
Since each S contributed two items 


to each point, the predicted per- 
centage recalled was the average of 
that for these two items. The ob- 
tained percentage recalled was either 
100, 50, or zero depending upon 
whether S recalled both, one, or 
neither of the items. Again the mean 
RI and PI computed in this way 
corresponded to the curves obtained 
by the over-all analysis. The RI and 
PI scores were analyzed by means of 
four Friedman chi-square tests. The 
first of these tested RI as a function of 
the rank of the recalled items. The 
chi square was 21.05 which was 
significant beyond the .0O1 level with 
3 df. The second tested PI as a 
function of the rank of the recalled 
item. ‘The chi square was 5.26 which 
was not significant (P = .14). The 
other two tested RI and PI respec- 
tively as a function of.the rank of the 
interfering item. ‘These values were 
£84 and 2.40, neither of which ap- 
proached significance. 

As expected, the stronger the item, 
the less the RI. This result corre- 
sponds quite well with studies in 
which the degree of learning was 
varied by manipulating the number of 
trials. The strength of the item did 
not appear to be an effective variable 
in the amount of PI, although there is 
some suggestion of a decrease in PI as 
strength increases. The difference 
between RI and PI may possibly be 
attributed to a difference in the range 
of total reinforcements. Since there 
was over all some positive transfer in 
learning List 2, the range of rein- 
forcements was less on List 2 than 
List 1. Although there was no differ- 
ence in the predicted probabilities for 
the corresponding ranks between the 
RI and PI groups, there is evidence 


which indicates that there is an 


increase in retention even when prob- 
abilities do not increase any further 
with the number of reinforcements. 
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It is possible that the restriction in 
range of number of reinforcements in 
the PI group was responsible for no 
relationship 


obtained. It is 
impossible to compare this result with 
the amount of PI produced by varying 
numbers of trials on the second list, 
since no studies have been done using 
this variable. Data by Briggs (3) 
shows that Ss given free recall after 
learning two lists to various criteria 
do not show large changes in the 
amount of PI with some degrees of 
first-list learning. 


being 


The fact that strength of the interfer- 
ing item is not related to RI and PI is 
somewhat surprising in view of the vast 
number of studies which show that the 
lower the degree of learning of the inter- 
fering list, the less the RI and PI. The 
result obtained here suggests that the 
differentiation between the two response 
systems may be the most important 
factor in due to interfering 
learning. Support for this notion, also 
comes from recent studies by Bilodeau 
and Schlosberg (2) and Greenspoon and 
Ranyard (5) which show considerably 
less interference when the two lists are 
learned in distinctive environments. 

These functions do not appear to be the 
result of any interaction between items 
of different strengths. When plotted as 
a joint function of the rank of the re 
called and interfering item, RI and PI 
did not show any systematic variation 
or interaction other than the 
ships previously discussed. 
will not be presented. 

The important fact obtained from the 
retention and RI and PI data is that the 
laws obtained between strength of as- 
sociation and retention when strength is 
measured within a list are not the same as 
those obtained when strength is varied 
by giving Ss different numbers of trials 
on the lists as a whole. 


forgetting 


relation- 
These data 


Relearning.-Further evidence for 
a difference in RI and PI occurs in the 
relearning data. The mean number 
of correct responses on each of the five 


relearning trials is plotted in Fig. 3. 
Analysis of variance on the mean total 
correct responses in relearning gave 
an F of 65.66 with heterogeneous 
variances due to the restriction of the 
A t test 
between the RI and PI groups gave 
a value of 2.18 (P 03). The 
difference occurs largely on the last 


range in the control group. 


two trials where the PI group performs 
as well as the control group, while the 
RI group is somewhat retarded. It 
has been the usual finding that sig- 
nificant RI and PI disappear after 
two relearning trials (9, 11, 12, 13, 14). 
The present study gave 
RI throughout the five 
trials and significant PI up to 
fourth trial. 
are not strictly comparable since the 


significant 
relearning 
the 
The studies cited above 
Ss served in all conditions in those 
studies. 

The overt error data suggest that 
this difference in RI and PI may be 
attributed to intralist rather than 
interlist interference. From Table 1, 
that the RI and PI 
groups made more total errors during 
relearning than the control 
(t= 5.19 and 3.05 for 
and Pi Vs. 


it can be seen 
group 
the RI vs 
control, 


control respec- 


es @ ~ 


MEAN C ORRECT RESPONSES 





a 


3 4 
TRIALS 








Fic. 3. 


Mean number of responses correct 


on five relearning trials 
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TABLE 1 


Mean Errors ver S ror tue Five 
ReLearnine TRIALS 
Total Errors 


Intrusions 


| aT. 


Mean 
214 ; 
3.94 F OS 16 
3.13 3 


Control 
RI 
PI 


} 
Group | 
| 


tively). The RI group also made 
more total errors than the PI group 

= 2.26, P = .03), but this differ- 
ence seems to be due to a greater 
number of intralist errors since the 
number of intrusions did not differ 
between the two groups. A further 
analysis of the errors occurring in re- 
learning showed that the RI group 
made a greater number of repeated 
errors and errors previously made in 
original learning, while most of the 
PI errors were intrusions or had never 
occurred before. In terms of Gibson’s 
hypothesis (4) concerning the spon- 
taneous recovery of generalization it 
appears as if the RI group had a more 
difficult time eliminating  intralist 
generalization tendencies recovered 
from the original learning. 


The effects of item difficulty.—In this 
study, the strength of an item and its 
difficulty are, of course, confounded. 
The above relationships, then, should 
best be described in terms of a joint func- 
tion of strength and difficulty. This 
is not crucial as it is precisely what the 
study set out to discover. It was origi- 
nally intended to also analyze transfer to 
the learning of the List 2 items as a func- 
tion of the strength of the List 1 item. 
Several analyses were attempted, with 
the general conclusion being that the 
relationship could not be obtained un- 
ambiguously from these data. This was 
due to a possible correlation between the 
difficulty of the List 1 item and the diffi- 
culty of the List 2 item. This could be 
perhaps due to the greater distinctive- 


ness of some stimulus components. 
Thus any measure of the learning of a 
List 2 item would be confounded with the 
difficulty of that item. This would not, 
however, enter into the recall! data, since 
the interest here is not in the learning of 
the List 2 item, but only in its empirical 
strength at the end of learning regardless 
of how or why it is of that strength. 


SUMMARY 


The purpose of this study was to determine 
the relationship between retention and the 
strength of the association, and the relationship 
between RI and PI and the strength of the 
competing responses. 

Two groups of 104 Ss each learned two paired 
adjective lists with identical stimuli and different 
responses. A third group of 104 Ss learned a 
single list. After 24 hr., all Ss recalled and re- 
learned either the first list (RI) or second list 
(PI). 

An item analysis was performed on the data. 
Items were ranked according to the number of 
reinforcements and retention scores were ob- 
tained for each rank corrected for the strength 
of the response at the end of learning. The 
results showed : 

1. Forgetting of the items in a single list 
increased then decreased as the strength of the 
item increased. 

2. RI was a function of the strength of the 
recalled item, but was not related to interfering 
item strength. PI was not 
recalled item strength or 
strength. 

3. Relearning was faster for the 
than for the RI group. 


related to either 
interfering item 


PI group 
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EFFECT OF VERBAL PRETRAINING 


ON THE ACQUISITION OF A COMPLEX MOTOR SKILL! 


WILLIAM F, BATTIG,A DONALD R. HOFFELD, SIDNEY SEIDENSTEIN, 
AND W. J. BROGDEN 


University of Wisconsin 


Battig (3) found that amount of positive 
transfer from verbal pretraining to performance 
of a finger-positioning task decreased as com 
plexity increased from one to two fingers and 
was zero for the 3- and 4-finger tasks. These 
and other results (2) suggest that relevant 
verbal pretraining has little or no effect on the 
acquisition of complex motor tasks 
relevance of the verbal used in 
previous studies could account for these negative 
results. The attempt to 
increase the degree of relevance of verbal pre 
training in order to test that hypothesis, with 
Battig’s (3) study as the point of departure 

Method.—Battig'’s (3) finger positioning ap 
paratus was used with the 4-finger task. The 
S’s were 36 University of Wisconsin psychology 


Insufficient 
pretraining 
present study is an 


aeeene in part by a grant from the National 


Science Foundation 


*Now at Psychological Laboratory, Universiiy of 
Virginia, 


students assigned at random to three groups of 
I2 each. Fach of three £’s ran four S’s in each 
group. The three groups differed only in type 
of pretraining. Group I, the control group, had 
For Group Il-—the 
pretraining group--the procedure and 
letters were identical to those for Battig’s (3) 
relevant-S group, the difference being that the 
letters could be seen only when the lights were 
lit. For Group III 


training 


no pretraining. 
words 


stimulus 


the response-words pre 
letter of the 
placed over each response light, 


group-—a alphabet was 
so that it was 
seen only when the light was lit by appropriate 
positioning of the response key. The letters 
were the same as for Group II, and S’s task was 
the same: to pronounce nonsense words formed 
from the letters covering the green lights that 
corresponded to the red lights 
pattern 


of the stimulus 
Group III differed from Group II 
primarily in that S could identify the letters only 
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Fic. 1. Mean number of matches per trial 
for each group. Breaks in 
l-min. intertrial rest 


curves represent 


However, S was 
not instructed to match the red and green lights, 


by moving the response keys 


only to move the keys until the correct letters 
could be identified 

The instructions given each S and the condi 
tions and procedure during the experimental 
Zattig (3) 
The mean number of matches per 


session were those used by 

Results 
trial for the three groups is presented in Fig. 1. 
Analysis of variance of scores summed over 20 
trials shows trials and the interaction of trials 
by groups to be the only significant sources of 
vaniation For analysis of variance of the data 
for the first trial and the first 
significant difference between groups occurs in 


five trials, a 


both case The Duncan range test (4) shows 
that on Trial 1, Group III is 
superior to both Groups I and II, which do not 
differ from each other. For Trials 1-5, Group 
III is significantly superior to Group II, but 
neither differs from Group I. 


significantly 


Analysis of vari 
ance of Trials 6-20 yields no significant differ 
ences between groups. 

An Alexander trend test (1) on the 
curves provides further analysis of the Trials 
X Groups interaction 
and 


group 


Significant over-all slope 


over-all deviations from linearity show 


significant nonlinear acquisition. The over-all 
between-groups slope term is significant, but 
pairwise comparisons show only that the slope 
for Group III is significantly less steep than the 
slopes for Groups I and II. 

Discussion.—The pretraining for Group III 
involved a form of motor practice (manipulating 
the response keys in order to view the letters 
on the stimulus lights) that may account for the 
The amount of 
practice in pretraining for Group III is roughly 


initial positive transfer 
equivalent to the amount of practice for Group I 
on Trial 8. <A t test of the difference between 
Group I on Trial 8 and Group HI on Trial 1 
shows Group I to have a statistically significant 
superior performance. The difference between 
Groups II and III is probably due also to the 
added motor practice given Group III during 
pretraining and provides no evidence of a differ 
ence in the effect of the verbal factors involved 
Thus, the results of this 
experiment give little or no evidence for positive 


in the two procedures 


transfer from either type of verbal pretraining 
and support Battig’s (3) conclusion that verbal 
pretraining has no beneficial effect on acquisition 
of a motor task as complex as the 4-finger task 


used here. 
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